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DESCRIPTION 
PROTEIN KINASES 

FIELD OF THE INVENTION 
The present invention relates to novel kinase polypeptides, nucleotide sequences 
encoding the novel kinase polypeptides, as well as various products and methods useful 
for the diagnosis and treatment of various kinase-related diseases and conditions. 

BACKGROUND OF THE INVENTION 
The following description of the background of the invention is provided to aid in 
understanding the invention, but is not admitted to be or to describe prior art to the 
invention. 

Cellular signal transduction is a fundamental mechanism whereby external stimuli 
that regulate diverse cellular processes are relayed to the interior of cells. One of the key 
biochemical mechanisms of signal transduction involves the reversible phosphorylation of 
proteins, which enables regulation of the activity of mature proteins by altering their 
structure and function. 

Protein phosphorylation plays a pivotal role in biological signal transduction. 
Among the biological functions controlled by protein phosphorylation are the following: 
cell division; differentiation and death (apoptosis); cell motility and cytoskeletal structure; 
control of DNA replication, transcription, splicing and translation; protein translocation 
events from the endoplasmic reticulum and Golgi apparatus to the membrane and 
extracellular space; protein nuclear import and export; regulation of metabolic reactions, 
etc. Abnormal protein phosphorylation is widely recognized to be causally linked to the 
etiology of many diseases including cancer as well as immunologic, neuronal and 
metabolic disorders. 

The most common phospho-acceptor amino acid residues are serine, threonine and 
tyrosine. Phosphorylation in histidine has also been observed in bacteria. The presence of 
a phosphate moeity modulates protein function in multiple ways. A common mechanism 
includes changes in the catalytic properties (V™, and K m ) of an enzyme leading to its 
activation or inactivation. A second widely recognized mechanism involves promoting 
protein-protein interactions. An example of this is the tyrosine autophosphorylation of the 
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ligand-activated EGF receptor tyrosine kinase. This event triggers the high-affinity 
binding to the phosphotyrosine residue on the receptor's C-terminal intracellular domain 
to the SH2 motif of the adaptor molecule Grb2. Grb2 in turn binds through its SH3 motif 
to a second adaptor molecule, such as SHC. The formation of this ternary complex 
acivates the signaling events that are responsible for the biological effects of EGF. Serine 
and threonine phosphorylation events have also being recently recognized to exert their 
biological function through protein-protein interaction events mediated by the high- 
affinity binding of phosphoserine and phosphothreonine to WW motifs present in a large 
.variety of proteins (Lu, PJ. et ai (1999) Science 283:1325-1328). A third important 
outcome of protein phosphorylation is changes in the subcellular localization of the 
substrate. As an example, nuclear import and export events in a large diversity of proteins 
are regulated by protein phosphorylation (Drier E.A. et ai (1999) Genes Dev 13: 556- 
568). 

Protein kinases are one of the largest families of eukaryotic proteins with several 
hundred known members. These proteins share a 250-300 amino acid domain that can be 
subdivided into 12 distinct subdomains that comprise the common catalytic core structure. 
These conserved protein motifs have recently been exploited using PCR-based and 
bioinformatic strategies leading to a significant expansion of the known kinases. Multiple 
alignment of the sequences in the catalytic domain of protein kinases and subsequent 
parsimony analysis permits their segregation into a dendrogram reflecting the relatedness 
of their catalytic domains (Fig. 1). In this manner, related kinases are clustered into 
distinct branches or subfamilies including: tyrosine kinases, cyclic-micleotide-dependent 
kinases, calcium/calmodulin kinases, cyclin-dependent kinases and MAP-kinases, serine- 
threonine kinase receptors, and several other less defined subfamilies. 

We have recently completed a systematic analysis of the protein kinases present in 
C elegans, the multicellular organism whose entire DMA sequence has been determined. 
We identified 473 unique kinase profiles including 398 full-length conventional kinases, 
and 20 additional proteins that may function as atypical protein kinases. (Plowman G.D. 
et ai (1999), Proc. Natl. Acad. Sci. 96:13603-13610). 

Using parsimony analysis, the protein kinases may be divided into 4 major groups: 
AGC, CAMK, CMGC and tyrosine kinases. In addition, there are a number of minor yet 
distinct families, including the STE and casein kinase 1, families related to worm- or 
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fungal-specific kinases, and a family designated "other" to represent several smaller 
families. In addition, we designate an "atypical" family to represent protein kinases whose 
catalytic domain has little or no primary sequence homology to conventional kinases, 
including the A6 kinases and P13 kinases. 

The AGC kinases are basic amino acid-directed enzymes that phosphorylate 
residues found proximal to Arg and Lys. Examples of this group are the cyclic nucleotide- 
dependent kinases, G protein kinases, NDR or DBF2 and the ribosomal S6 kinases. 

The CAMK group kinases are also basic amino acid-directed kinases. They include 
the Ca2+/calmodulin-regulated and AMP-dependent protein kinases, myosin light chain 
kinases, checkpoint 2 kinases (CHK2) and EMK-related protein kinases. The EMK family 
of STK are involved in the control of cell polarity, micotubule stability and cancer. One 
member of the EMK family, C-TAK1 has been reported to control entry into mitosis by 
activating Cdc25C which in tum dephosphorylates Cdc2. 

CMGC group kinases are "proline-directed" enzymes phosphorylating residues 
that exist in a proline-rich context. They include the cyclin-dependent kinases (CDKs), 
mitogen-activated kinases (MAPKs), GSK3s and CLKs. Most CMGC kinases have 
larger-than-average kinase domains owing to the presence of insertions within subdomains 
X and XI. 

The tyrosine kinase group encompass both cytoplasmic (i.e. src) as well as 
transmembrane receptor tyrosine kinases (i.e. EGF receptor). These kinases play a pivotal 
role in the signal transduction processes that mediate cell proliferation, differentiation and 
apoptotis. 

Group members that define smaller, yet distinct phylogenetic branches of 
conventional kinases include the elongation factor 2 kinases (ETFKs); homologues of the 
yeast sterile family kinases (STE) which refers to 3 classes of kinases which lie 
sequentially upstream of the MAPKs; mixed lineage kinases (MLKs); Lim-domain 
containing kinases (LIMKs); Calcium-calmodulin kinase kinases (CAMKK), dual-specific 
tyrosine kinases (DYRK), integrin receptor associated kinase (IRAK); testis-specific 
kinases (TSK); UNC-51 related kinases (UNC); several families that are close 
homologues to worm (C26C2.1 , YQ09, ZC581 .9, YFL033c, C24A1 .3), Drosophila 
(SLOB), or yeast (YDOD_sp, YGR262_sc) kinases, and others that are "unique" and 
don't cluster into any obvious family. 
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SUMMARY OF THE INVENTION 
Through a search of the EST database for homologies to the conserved catalytic 
kinase domain of protein kinases, hundreds of mammalian members of known and 
previously unidentified protein kinase families and groups have been identified as part of 
the present invention. Multiple alignment and parsimony analysis of the catalytic domain 
reveals that approximately half of these protein kinases cluster into 10 known groups, with 
the other half perhaps defining novel groups. Classification in this manner has proven 
highly accurate not only in predicting motifs present in the remaining non-catalytic portion 
of each protein, but also in their regulation, substrates, and signaling pathways. The 
present invention includes the partial or complete sequence of new protein kinases, their 
classification, predicted or deduced protein structure, and a strategy for elucidating their 
biologic and therapeutic relevance. 

Thus, a first aspect of the invention features an isolated, enriched, or purified 
nucleic acid molecule encoding a kinase polypeptide selected from the group consisting 
SEQ ED NO:122, SEQ ED NO:123, SEQ ID N0.124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, 
SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, 
SEQ ED NO:142, SEQ ED NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ ED NO:147, SEQ ED NO:148, SEQ ED NO:149, SEQ ED NO:150, SEQ ED NO:151 , 
SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:l 54, SEQ ED NO:155, SEQ ED NO:156, 
SEQ ED NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, 
SEQ ED NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, 
SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171 , 
SEQ ID NO: 172, SEQ ID NO:173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, 
SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , 
SEQ ID NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NOT 96, 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ED NO:21 0, SEQ ID NO:21 1 , 
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SEQ ID NO:212, SEQ ID NO:213 5 SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , 
5 SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242. 

By "isolated" in reference to nucleic acid is meant a polymer of nucleotides 
conjugated to each otherjncluding DNAjand .RNA,.that is isolated from a natural source 

10 or that is synthesized. The isolated nucleic acid of the present invention is unique in the 

sense that it is not found in a pure or separated state in nature. Use of the term "isolated" 
indicates that a naturally occurring sequence has been removed from its normal cellular 
(i.e., chromosomal) environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 

15 the only nucleotide chain present, but that it is essentially free (about 90 - 95% pure at 

least) of non-nucleotide material naturally associated with it, and thus is distinguished 
from isolated chromosomes. 

By the use of the term "enriched" in reference to nucleic acid is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction (2 - 5 fold) of 

20 the total DNA or RNA present in the cells or solution of interest than in normal or 

diseased cells or in the cells from which the sequence was taken. This could be caused by 
a person by preferential reduction in the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA or RNA sequence, or by a 
combination of the two. However, it should be noted that enriched does not imply that 

25 there are no other DNA or RNA sequences present, just that the relative amount of the 

sequence of interest has been significantly increased. The term "significant" is used to 
indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other nucleic acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term also does not imply that there is no 

30 DNA or RNA from other sources. The other source DNA may, for example, comprise 
DNA from a yeast or bacterial genome, or a cloning vector such as pUC19. This term 
distinguishes from naturally occurring events, such as viral infection, or tumor type 
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growths, in which the level of one mRNA may be naturally increased relative to other 
species of mRNA. That is, the term is meant to cover only those situations in which a 
person has intervened to elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that a nucleotide sequence be in purified 
5 form. The term "purified" in reference to nucleic acid does not require absolute purity 

(such as a homogeneous preparation). Instead, it represents an indication that the 
sequence is relatively more pure than in the natural environment (compared to the natural 
level this level should be at least 2-5 fold greater, e.g., in terms of mg/mL). Individual 
clones isolated from a cDNA library may be purified to electrophoretic homogeneity. The 

10 claimed DNA molecules obtained from these clones could be obtained directly from total 

DNA or from total RNA. The cDNA clones are not naturally occurring, but rather are 
preferably obtained via manipulation of a partially purified naturally occurring substance 
(messenger RNA). The construction of a cDNA library from mRNA involves the creation 
of a synthetic substance (cDNA) and pure individual cDNA clones can be isolated from 

1 5 the synthetic library by clonal selection of the cells carrying the cDNA library. Thus, the 

process which includes the construction of a cDNA library from mRNA and isolation of 
distinct cDNA clones yields an approximately 10 6 -fold purification of the native message. 
Thus, purification of at least one order of magnitude, preferably two or three orders, and 
more preferably four or five orders of magnitude is expressly contemplated. 

20 By a "kinase polypeptide" is meant 10 (preferably 20, more preferably 40, most 

preferably 75) or more contiguous amino acids set forth in an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ED NO:127, SEQ ID N0.128, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ED NO:131, SEQ ED NO: 132, SEQ ID NO: 133, 

25 SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, 

SEQ ID NO:139, SEQ ED NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO:144, SEQ ED NO:145, SEQ ID NO:146, SEQ ED NO:147, SEQ ID NO:148, 
SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, 
SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, 

30 SEQ ID NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ED NO: 163, 
SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, 
SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ED NO: 1 73, 
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SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ ID NO: 1 82, SEQ ID NO: 1 83, 
SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, 
SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, 
5 SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQIDNO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 

10 SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or functional 

] 5 derivatives thereof as described herein. For sequences for which the full-length sequence 

is not given, the remaining sequences can be determined using methods well-known to 
those in the art and are intended to be included in the invention. In certain aspects, 
polypeptides of 100, 200, 300 or more amino acids are preferred. The kinase polypeptide 
can be encoded by a full-length nucleic acid sequence or any portion of the full-length 

20 nucleic acid sequence, so long as a functional activity of the polypeptide is retained. By 
"functional" domain is meant any region of the polypeptide that may play a regulatory or 
catalytic role as predicted from amino acid sequence homology to other proteins or by the 
presence of amino acid sequences that may give rise to specific structural conformations 
(i.e., coiled-coils). For some purposes, polypeptide domains are preferred, including, but 

25 not limited to, N-terminal, catalytic/kinase and C-terminal. 

The amino acid sequence will be substantially similar to a sequence selected from 
the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID 
NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID 
NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID 

30 NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID 
NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID 
NO: 144, SEQ ID NO: 145, SEQ ED NO: 146, SEQ ID NO: 147, SEQ ED NO: 148, SEQ ID 
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NO:149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID 
NO: 1 54, SEQ ED NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ED NO: 158, SEQ ED 
NO:159, SEQ ED NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID 
NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 

5 NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 1 71 , SEQ ED NO: 1 72, SEQ ED NO:l 73, SEQ ID 
NO: 174, SEQ ED NO: 175, SEQ ED NO: 176, SEQ ED NO: 177, SEQ ED NO: 178, SEQ ID 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 81 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID 
NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID NO:188, SEQ ED 
. -NO: l 89, SEQ ED NO: 1 90, SEQ ED NO: 1 91 , SEQ ED NO: 1 99, SEQ ED NO: 1 93, SEQ ED 

10 NO: 194, SEQ ID NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ID NO: 198, SEQ ID 

NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID NO:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ ID 

15 NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID 

NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID N0.227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ED NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID 
NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ED NO:240, SEQ ED NO:241 , and SEQ ID NO:242, or the corresponding 

20 full-length amino acid sequence, or fragments thereof. A sequence that is substantially 
similar to a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 1 22, SEQ ID NO: 1 23, SEQ ED NO: 1 24, SEQ ED NO: 1 25, SEQ ID NO: 1 26, SEQ ED 
NO: 127, SEQ ED NO: 128, SEQ ED NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ED NO: 135, SEQ ID NO: 136, SEQ ID 

25 NO:137, SEQ ID NO:138, SEQ ED NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO: 1 42, SEQ ED NO: 1 43, SEQ ID NO: 1 44, SEQ ED NO: 1 45, SEQ ID NO: 1 46, SEQ ID 
NO: 147, SEQ ED NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID 
NO:152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ED NO: 155, SEQ ED NO:156, SEQ ID 
NO:l 57, SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ID 

30 NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 71 , SEQ ID 
NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, SEQ ID 



WO 00/73469 



PCT/US00/14842 



9 



NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ ID 
NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID 
NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
5 NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:2 1 0, SEQ ED NO:2 1 1 , SEQ ID 
NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ID 
NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 

1 0 NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ID 
NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
ED NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 

1 5 and most preferably 99-1 00%) to a sequence selected from the group consisting of those 

set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 
ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ED NO: 130, SEQ 
ID NO:131, SEQ ID NO:132, SEQ ED NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ 
ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ED NO: 140, SEQ 

20 ID NO: 1 4 1 , SEQ ED NO: 1 42, SEQ ED NO: 1 43, SEQ ED NO: 1 44, SEQ ED NO: 1 45, SEQ 

ID NO: 1 46, SEQ ED NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ED NO: 150, SEQ 
ID NO: 1 5 1 , SEQ ED NO: 1 52, SEQ ED NO: 1 53, SEQ ED NO: 1 54, SEQ ID NO: 1 55, SEQ 
ID NO: 1 56, SEQ ED NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ED NO: 1 60, SEQ 
ID NO: 1 61 , SEQ ED NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ 

25 ID NO: 166, SEQ ED NO: 167, SEQ ID NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ 

ID NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ED NO: 184, SEQ ID NO: 185, SEQ 
ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ 

30 ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ 
ID NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ 
ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
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ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID NO:2I6, SEQ ID NO:217, SEQ ID N0:21S, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ED NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ 
ED NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ 
ID NO:241, and SEQ ID NO:242 or portions of or the entire corresponding full-length 
amino acid sequences. 

By "identity" is meant a property of sequences that measures their similarity or 
relationship. Identity is measured by dividing the number of identical residues between 
two sequences (either full-length or a defined domain) by the total number of residues in 
the known sequence, or the domain of the known sequence, and multiplying the product 
by 100. Thus, two copies of exactly the same sequence have 100% identity, but sequences 
that are less highly conserved, and have replacements and substitutions, have a lower 
degree of identity. "Gaps" are spaces in an alignment that can result from aligning a novel 
sequence with a known sequence when the novel sequence has additions or deletions of 
amino acids in comparison with the known sequence. These gaps do not factor into the 
assessment of % identity using the sbove calculation. 

Those skilled in the art will recognize that several computer programs are also 
available for determining sequence identity using standard parameters, for example, Blast 
(Altschul, et al (1997) Nucleic Acids Res. 25:3389-3402), Blast2 (Altschul, et al (1990) 
J. Mol. Biol. 215:403-410), and Smith-Waterman (Smith, et al. (1981) J. Mol. Biol. 
147:195-197). 

In preferred embodiments, the invention features isolated, enriched, or purified 
nucleic acid molecules encoding a kinase polypeptide comprising a nucleotide sequence 
that: (a) encodes a polypeptide having an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ 
ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ED NO:134, SEQ 
ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ 
ID NO: 1 40, SEQ ID NO: 1 41 , SEQ ID NO: 1 42, SEQ ED NO: 1 43, SEQ ID NO: 1 44, SEQ 
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ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ 
ID NO: 150, SEQ ED NO: 151, SEQ DD NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ 
ID NO: 1 55, SEQ DD NO: 1 56, SEQ DD NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ 
DD NO: 160, SEQ DD NO: 161, SEQ DD NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ 
5 DD NO: 165. SEQ DD NO: 166, SEQ DD NO: 167, SEQ ID NO: 168, SEQ DD NO: 1 69, SEQ 
DD NO: 1 70, SEQ DD NO: 1 71 , SEQ DD NO: 1 72, SEQ ID NO: 1 73, SEQ DD NO: 1 74, SEQ 
DD NO: 1 75, SEQ DD NO: 1 76, SEQ DD NO: 1 77, SEQ DD NO: 1 78, SEQ DD NO: 1 79, SEQ 
DD NO: 1 80, SEQ DD NO: 1 8 1 , SEQ DD NO: 1 82, SEQ DD NO: 1 83, SEQ DD NO: 1 84, SEQ 
ID NO: 1 85, SEQ DD NO: 1 86, SEQ DD NO: 1 87, SEQ DD NO: 1 88, SEQ DD NO: 1 89, SEQ 

1 0 ID NO: 1 90, SEQ DD NO: 1 91 , SEQ DD NO: 1 99, SEQ DD NO: 1 93, SEQ DD NO: 1 94, SEQ 

DD NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ 
ID NO:200, SEQ DD NO:201 , SEQ DD NO:202, SEQ DD NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ DD NO:206, SEQ DD NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
DDNO:210, SEQ DD NO:21 1, SEQ DD NO:212, SEQ IDNO:213, SEQ ID NO:214, SEQ 

15 DDNO:215, SEQ DD NO:216, SEQ DD NO:217, SEQ ID NO:218, SEQ DD NO:219, SEQ 
ID NO:220, SEQ DD NO:221, SEQ DD NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ DD NO:227, SEQ DD NO:228, SEQ ID NO:229, SEQ 
DD NO:230, SEQ DD NO:231 , SEQ DD NO:232, SEQ DD NO:233, SEQ DD NO:234, SEQ 
ID NO:235, SEQ DD NO:236, SEQ DD NO:237, SEQ DD NO:238, SEQ DD NO:239, SEQ 

20 DD NO:240, SEQ DD NO:241 , and SEQ DD NO:242, or the corresponding full-length 

amino acid sequence, or fragments thereof. A sequence that is substantially similar to a 
sequence selected from the group consisting of those set forth in SEQ DD NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ DD NO:125, SEQ DD NO:126, SEQ DD NO:127, SEQ ID 
NO: 1 28, SEQ DD NO: 129, SEQ DD NO: 1 30, SEQ DD NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 

25 NO: 1 33, SEQ DD NO: 1 34, SEQ DD NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID 
NO: 138, SEQ DD NO: 139, SEQ DD NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ DD NO:144, SEQ DD NO:145, SEQ DD NO:146, SEQ DD NO:147, SEQ DD 
NO: 1 48, SEQ DD NO: 1 49, SEQ DD NO: 1 50, SEQ DD NO: 1 5 1 , SEQ DD NO: 1 52, SEQ ID 
NO: 153, SEQ DD NO: 154, SEQ DD NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 

30 NO: 158, SEQ DD NO: 159, SEQ ID NO: 160, SEQ DD NO: 161, SEQ DD NO: 162, SEQ ID 
NO: 163, SEQ DD NO: 164, SEQ DD NO: 165. SEQ DD NO: 166, SEQ ID NO: 167, SEQ DD 
NO: 1 68, SEQ DD NO: 1 69, SEQ DD NO: 1 70, SEQ DD NO: 1 71 , SEQ DD NO: 1 72, SEQ ID 
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NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 182, SEQ ID 
NO:183, SEQ ID NO: 184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
5 NO:193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ IDNO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO.206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 

10 NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID N0.236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242 will 

] 5 have at least 75% identity (preferably 90%, more preferably at least 95% and most 

preferably 99-100%) to the sequence selected from the group consisting of those set forth 
in SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 125, SEQ ID 
NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID 
NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID 

20 NO: 136, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID 
NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID 
NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 150, SEQ ID 
NO:l 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID 
NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID 

25 NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID 
NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID 
NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID 
NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ID 
NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID 

30 NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID 
NO: 191, SEQ ED NO: 199, SEQ ED NO: 193, SEQ ID NO:194, SEQ ID NO: 195, SEQ ID 
NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID 
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NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208 5 SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
N0:2 1 6, SEQ ID N0:2 1 7, SEQ ID N0:2 1 8, SEQ ID N0:2 1 9, SEQ ID NO:220, SEQ ID 
5 NO:22 1 , SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ED 
NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID 
NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID 
NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242; (b) is the complement of the nucleotide sequence of (a); (c) 

1 0 hybridizes under highly stringent conditions to the nucleotide molecule of (a) and encodes 

a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having an 
amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ED NO:128, SEQ ID NO:129, SEQ ED NO.130, SEQ ID NO:131, SEQ ED 

15 NO:132, SEQ ID NO: 133, SEQ ED NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 

NO: 137, SEQ ED NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 3 41, SEQ ID 
NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ED NO: 146, SEQ ED 
NO:147, SEQ ID NO:148, SEQ ED NO:149, SEQ ID NO:150, SEQ ED NO:151, SEQ ID 
NO:152, SEQ IDNO:153, SEQ ID NO: 154, SEQ EDNO:155, SEQ ID NO:156, SEQ ID 

20 NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ED NO: 161, SEQ ID 
NO: 1 62, SEQ ID NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ID NO: 175, SEQ ED NO: 176, SEQ ID 
NO: 177, SEQ ID NO: 178, SEQ ED NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID 

25 NO:182, SEQ ID NO: 183, SEQ ED NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO.i 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

30 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
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NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
5 ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 

A sequence that is substantially similar to a sequence selected from the group consisting of 
those set forth in SEQ ID NO: 122, SEQ ID NO:123, SEQ ID NO: 124, SEQ ID NO: 125, 
SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, 

10 SEQ ID NO:136, SEQ ED NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, 
SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ID NO:151, SEQ ED NO:152, SEQ ID NO:l 53, SEQ ID NO:154, SEQ ID NO:l 55, 
SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO:l 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, 

15 SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. 

SEQIDNO:166, SEQIDNO:167, SEQIDNO:168, SEQEDNO:169, SEQIDNO:170, 
SEQ ID NO: 171, SEQ ID NO:172, SEQ ID NO:l 73, SEQ ID NO:l 74, SEQ ID NO:175, 
SEQ ED NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, 
SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 

20 SEQ ID NO:l 86, SEQ ID NO:l 87, SEQ ID NO.l 88, SEQ ID NO:l 89, SEQ ID NO:l 90, 
SEQ ED NO: 1 9 1 , SEQ ED NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, 
SEQ ID NO:201, SEQ ID NO.202, SEQ ED NO:203, SEQ ED NO:204, SEQ ID NO:205, 
SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, 

25 SEQ ID NO:2 1 1 , SEQ ED NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 1 5, 

SEQ ID NO:2 1 6, SEQ ED NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, 
SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, 

30 SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ED NO:242 will have at least 75% identity (preferably 90%, 
more preferably at least 95% and most preferably 99-100%) to the sequence of SEQ ID 
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NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
5 NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 

NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID 
NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID 

10 NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID 
NO: 182, SEQ ID NO: 183, SEQ ED NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ID 
NO: 187, SEQ ED NO: 188, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ID 

15 NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID 

NO:197, SEQ ID NO:198, SEQ ED NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 

20 NO:2 1 7, SEQ ID NO:2 1 8, SEQ ED NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 
NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 

25 ED NO:242, except that it lacks one or more, but not all, of a domain selected from the 
group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
coiled-coil structure region, a proline-rich region, a spacer region, an insert, and a C- 
lerminal tail; (e) is the complement of the nucleotide sequence of (d); (f) encodes a 
polypeptide having an amino acid sequence selected from the group consisting of those set 

30 forth in SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID 
NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID 
NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
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NO: 136, SEQ ID NO: 137, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID 
NO:141, SEQ ID NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID 
NO: 1 46, SEQ ED NO:l 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID 
NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID 
5 NO: 156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, SEQ ID 
NO: 1 6 1 , SEQ ED NO: 1 62, SEQ ID NO: 1 63 , SEQ ID NO: 1 64, SEQ ED NO: 1 65 . SEQ ID 
NO: 166, SEQ ED NO: 167, SEQ ED NO: 168, SEQ ID NO: 169, SEQ ED NO: 170, SEQ ID 
NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ID 
NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID 

10 NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID 
NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID 
NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID 
NO: 1 96, SEQ ED NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 

1 5 NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID 
NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ID 
NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 

20 NO:23 1 , SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID 

NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242, or the corresponding full-length amino acid sequence, or 
fragments thereof. (The domain demarcations of the polypeptides of the invention are 
indicated in Table 2 by reference to the kinase domain.) A sequence that is substantially 

25 similar to a sequence selected from the group consisting of those set forth in SEQ ID 

NO: 1 22, SEQ ID NO: 1 23, SEQ ED NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID 
NO: 1 32, SEQ ED NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ED NO:139, SEQ EDNO:140, SEQ ID NO:141, SEQ ID 

30 NO: 1 42, SEQ ID NO: 1 43 , SEQ ED NO: 1 44, SEQ ED NO: 1 45, SEQ ID NO: 1 46, SEQ ED 
NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ED 
NO:152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ID NO: 155, SEQ ED NO: 156, SEQ ID 
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NO: 157, SEQ ED NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:171, SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
5 NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO:l 82, SEQ ID NO:l 83, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:l 86, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:201 , SEQ ID 

10 NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ED NO:2 1 0, SEQ ED NO:2 1 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ED 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ 1DNO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 

1 5 NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ID N0.239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 
and most preferably 99-100%) to the sequence selected from the group consisting of those 

20 set forth in SEQ ED NO: 1 22, SEQ ID NO: 1 23, SEQ ED NO: 1 24, SEQ ID NO: 1 25, SEQ 
EDNO:I26, SEQ ID NO:127, SEQ ED NO:128, SEQ ED NO:129, SEQ ID NO:130, SEQ 
ED NO: 1 3 1 , SEQ ED NO: 1 32, SEQ ED NO: 1 33, SEQ ED NO: 1 34, SEQ ID NO: 1 35, SEQ 
ED NO: 136, SEQ ED NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ 
ED NO: 1 4 1 , SEQ ED NO: 1 42, SEQ ED NO: 1 43, SEQ ED NO: 1 44, SEQ ID NO: 1 45, SEQ 

25 ID NO: 146, SEQ ED NO: 147, SEQ ED NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ 

IDNO:151, SEQ ED NO:152, SEQ ED NO:153, SEQ ED NO:154, SEQ IDNO:155, SEQ 
ED NO: 1 56, SEQ ID NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ 
ED NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ 
ID NO:l 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ED NO: 1 70, SEQ 

30 1DN0:171, SEQ ED NO:172, SEQ ID NO: 173, SEQ ED NO:174, SEQ ID NO:175, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ 
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ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ 
ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ 
ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ 
ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 

5 ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ 

1DN0:21 1, SEQ IDNO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
IDNO:2l6, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

10 ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ED NO:242; (b) is the complement of the nucleotide sequence of (a); 
(c) hybridizes under highly stringent conditions to the nucleotide molecule of (a) and 
encodes a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having 

1 5 an amino acid sequence selected from the group consisting of those set forth in SEQ ID 

NO:122, SEQ ID NO:123, SEQ ED NO:124, SEQ ID NO:125, SEQ ED NO:126, SEQ ID 
NO: 127, SEQ ED NO: 128, SEQ ED NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 1 32, SEQ ID NO:l 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
NO: 1 37, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 1 40, SEQ ID NO: 1 41 , SEQ ID 

20 NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 

NO: 1 47, SEQ ID NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 1 52, SEQ ED NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
NO: 157, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ED NO: 166, SEQ ID 

25 NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ID 
NO: 177, SEQ ED NO: 178, SEQ ED NO:179, SEQ ED NO:180, SEQ ED NO: 181, SEQ ID 
NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID 
NO: 1 87, SEQ ED NO:l 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 

30 NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, . SEQ ID NO: 196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ED NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
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NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ IDNO:218, SEQ IDNO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 

5 NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 
A sequence that is substantially similar to a sequence selected from the group consisting of 

10 those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ID NO: 135, 
SEQ ID NO: 136, SEQ ED NO: 137, SEQ ED NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, 
SEQ ED N0.141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 

15 SEQ ED NO: 146, SEQ ID NO:147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 

SEQ ED NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ED NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, 
SEQ ED NO: 156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, 
SEQ ED NO: 1 61 , SEQ ED NO: 1 62, SEQ ID NO: 1 63, SEQ ED NO: 1 64, SEQ ED NO: 1 65. 
SEQ ED NO: 1 66, SEQ ED NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, 

20 SEQ ED NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, 
SEQ ED NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, 
SEQ ED NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85, 
SEQ ED NO:l 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, 
SEQ ED NO: 191, SEQ ED NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, 

25 SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, 
SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, 
SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, 
SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:21 5, 
SEQ ID NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, 

30 SEQ ID NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, 
SEQ ID NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, 
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SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241 , and SEQ ID NO:242 will have at least 75% identity (preferably 90%, 
more preferably at least 95% and most preferably 99-100%) to a domain of a polypeptide 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO:123, 
5 SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, 

SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, 
SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, 
SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID NO: 1 43, 
SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, 

10 SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53 , 
SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, 
SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, 
SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, 
SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, 

15 SEQ ID NO:l 74, SEQ ID NO: 1 75, SEQ ID NO:l 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, 

SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, 
SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, 
SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, 
SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, 

20 SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 

25 SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ED NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, where the 
domain is selected from the group consisting of an N-terminal domain, a catalytic domain, 

30 a C-terminal domain, a coiled-coil structure region, a proline-rich region, a spacer region, 
an insert, and a C-terminal tail; (g) is the complement of the nucleotide sequence of (f); (h) 
encodes a polypeptide having an amino acid sequence selected from the group consisting 
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of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID 

NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID 

NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ED NO: 1 34, SEQ ID 

NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID 
NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID 
NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID 
NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO. 158, SEQ ID NO: 159, SEQ ID 
NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID 
NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID 
NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID 
NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID 
NO: 1 80, SEQ ED NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID 
NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID 
NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ID 
NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID 
NO.200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ED N0.213, SEQ ID NO:214, SEQ ID 
NO:2 1 5, SEQ ID NO:2 1 6, SEQ ID NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ID NO:226, SEQ LD NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 
NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the corresponding full-length amino 
acid sequence, or fragments thereof. A sequence that is substantially similar to a sequence 
selected from the group consisting of those set forth in SEQ ED NO: 122, SEQ ID NO: 123, 
SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, 
SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, 
SEQ ID NO:134, SEQ ED NO:135, SEQ DD NO:136, SEQ ID NO:137, SEQ ID NO:138, 
SEQ ED NO:139, SEQ ED NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ DD NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, 
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SEQ ID NO: 149, SEQ ED NO: 150, SEQ ED NO: 151, SEQ ID NO: 152, SEQ ED NO: 153, 
SEQ ED NO:l 54, SEQ ED NO:l 55, SEQ ID NO: 1 56, SEQ ED NO: 1 57, SEQ ID NO: 1 58, 
SEQ ED NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, 
SEQ ED NO: 1 64, SEQ ED NO: 1 65 . SEQ ED NO: 1 66, SEQ ED NO: 1 67, SEQ ED NO: 1 68, 

5 SEQ ED NO: 169, SEQ ED NO: 170, SEQ ID NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, 
SEQ ED NO: 1 74, SEQ ED NO:l 75, SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, 
SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 81 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, 
SEQ ID NO: 1 84, SEQ ED NO:l 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ED NO: 1 88, 
SEQ ED NO:189, SEQ EDNO:190, SEQ ED NO:191, SEQ ED NO:199, SEQ ID NO:193, 

10 SEQ ED NO: 1 94, SEQ ED NO: 1 95 , SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, 
SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ED NO:214, SEQ EDNO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID NO:218, 

15 SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ID NO:233, 
SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, 
SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242 will have at 

20 least 75% identity (preferably 90%, more preferably at least 95% and most preferably 99- 

100%) to the sequence selected from the group consisting of those set forth in SEQ ED 
NO: 122, SEQ ED NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ED NO: 129, SEQ ED NO: 130, SEQ ID NO: 131, SEQ ID 
NO:132, SEQ ED NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ED 

25 NO: 137, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID 

NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ ID NO:l 50, SEQ ED NO: 151, SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 156, SEQ ID 
NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ ED NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID 

30 NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ ED NO: 1 66, SEQ ID 
NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ED 
NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
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NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID 
NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 

5 NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID 
N0.212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 

10 NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ IDNO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241 , and SEQ 
ID NO:242; (b) is the complement of the nucleotide sequence of (a); (c) hybridizes under 

1 5 highly stringent conditions to the nucleotide molecule of (a) and encodes a naturally 
occurring kinase polypeptide; (d) encodes a kinase polypeptide having an amino acid 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
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NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
5 N0:2 1 8, SEQ ID N0:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID N0.237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242 5 or the 
1 0 corresponding full-length amino acid sequence, or fragments thereof. A sequence that is 

substantially similar to a sequence selected from the group consisting of those set forth in 
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SEQ ID NO:227, SEQ ID NO:22S, SEQ ID NO:229, SEQ ID NO.230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242 will have at least 75% identity (preferably 90%, more preferably at 
least 95% and most preferably 99-1 00%) to the sequence of SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ED NO:142, SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID 
NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID 
NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO:183, SEQ ID NO:l 84, SEQ ID NO:185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ED NO: 197, SEQ ED 
NO: 198, SEQ ED NO:l 99, SEQ ED NO:200, SEQ ID NO:201, SEQ ED NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID N0.232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242, 
except that it lacks one or more of the domains selected from the group consisting of a N- 
terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil structure region, a 
proline-rich region, a spacer region, an insert, and a C-terminal tail; or (i) is the 



WO 00/73469 



PCTAJS00/14842 



complement of the nucleotide sequence of (h). The domain demarcations of the 
polypeptides of the invention are indicated in Table 2 by reference to the kinase domain. 

The term "complement" refers to two nucleotides that can form multiple favorable 
interactions with one another. For example, adenine is complementary to thymine as they 

5 can form two hydrogen bonds. Similarly, guanine and cytosine are complementary since 

they can form three hydrogen bonds. A nucleotide sequence is the complement of another 
nucleotide sequence if all of the nucleotides of the first sequence are complementary to all 
of the nucleotides of the second sequence. 

The term "domain" refers to a region of a polypeptide that contains a particular 

1 0 function. For instance, N-terminal or C-terminal domains of signal transduction proteins 

can serve functions including, but not limited to, binding molecules that localize the signal 
transduction molecule to different regions of the cell or binding other signaling molecules 
directly responsible for propagating a particular cellular signal. Some domains can be 
expressed separately from the rest of the protein and function by themselves, while others 

1 5 must remain part of the intact protein to retain function. The latter are termed functional 

regions of proteins and also relate to domains. 

The term "N-terminal domain" refers to the extracatalytic region located between 
the initiator methionine and the catalytic domain of the protein kinase. The N-terminal 
domain can be identified following a Smith-Waterman alignment of the protein sequence 

20 against the non-redundant protein database to define the N-terminal boundary of the 

catalytic domain. Depending on its length, the N-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose N-terminal 
domain has been shown to play a regulatory role is PAK65, which contains a CRIB motif 
used for Cdc42 and rac binding (Burbelo, PX>. et al (1995) J. Biol. Chem. 270, 29071- 

25 29074). The N-terminal domain of a protein kinase of the invention is that portion of the 

protein kinase to the amino-terminal side of the kinase domain where the kinase domain is 
identified in Table 2, herein. Further, in some cases, portions of the N-terminal domains 
of the protein kinases of the invention have not been identified since the entire sequence is 
not available. However, with the methods described herein, the full-length sequences of 

30 the kinases of the invention can be determined and using the approaches described herein 
the N-terminal domain can be identified. 
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The term "catalytic domain" or "kinase domain" refers to a region of the protein 
kinase that is typically 25-300 amino acids long and is responsible for carrying out the 
phosphate transfer reaction from a high-energy phosphate donor molecule such as ATP or 
GTP to itself (autophosphorylation) or to other proteins (exogenous phosphorylation). 

5 The catalytic domain of protein kinases is made up of 12 subdomains that contain highly 

conserved amino acid residues, and are responsible for proper polypeptide folding and for 
catalysis. The catalytic domain can be identified following a Smith-Waterman alignment 
of the protein sequence against the non-redundant protein database. The catalytic/kinase 
domains of the protein kinases of the invention are identified in Table 2, herein. Further, 

1 0 in some cases, the complete sequence of the catalytic/kinase domains of the protein 

kinases of the invention may not have been provided since the entire sequence is not 
available. However, with the methods described herein, the full-length sequences of the 
kinases of the invention can be determined, and using the approaches described herein, the 
catalytic/kinase domain can be identified. 

1 5 The term "catalytic activity", as used herein, defines the rate at which a kinase 

catalytic domain phosphorylates a substrate. Catalytic activity can be measured, for 
example, by determining the amount of a substrate converted to a phosphorylated product 
as a function of time. Catalytic activity can be measured by methods of the invention by 
holding time constant and determining the concentration of a phosphorylated substrate 

20 after a fixed period of time. Phosphorylation of a substrate occurs at the active-site of a 

protein kinase. The active-site is normally a cavity in which the substrate binds to the 
protein kinase and is phosphorylated. 

The term "substrate" as used herein refers to a molecule phosphorylated by a 
kinase of the invention. Kinases phosphorylate substrates on serine/threonine or tyrosine 

25 amino acids. The molecule may be another protein or a polypeptide. 

The term "C-terminal domain" refers to the region located between the catalytic 
domain and the carboxy-terminal ammo acid residue of the protein kinase. The C- 
terminal domain can be identified by using a Smith-Waterman alignment of the protein 
sequence against the non-redundant protein database to define the C-terminal boundary of 

30 the catalytic domain or of any functional C-tenninal extracatalytic domain. Depending on 
its length and amino acid composition, the C-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose C-terminal 
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domain may play a regulatory role is PAK3 which contains a heterotrimeric G b subunit- 
binding site near its C-terminus (Leeuw, T. et al (1998) Nature, 391, 191-195). The C- 
terminal domain of a protein kinase of the invention is that portion of the protein kinase to 
the carboxy-terminal side of the kinase domain where the kinase domain is identified in 
5 Table 2, herein. In some cases, the C-terminal domains of the protein kinases of the 

invention have not been provided since the entire sequence is not available. However, 
with the methods described herein, the full-length sequences of the kinases of the 
invention can be determined, and using the approaches described herein, the C-terminal 
domain can be identified. 

10 The term "signal transduction pathway*' refers to the molecules that propagate an 

extracellular signal through the cell membrane to become an intracellular signal. This 
signal can then stimulate a cellular response. The polypeptide molecules involved in 
signal transduction processes are typically receptor and non-receptor protein tyrosine 
kinases, receptor and non-receptor protein phosphatases, SRC homology 2 and 3 domains, 

15 phosphotyrosine binding proteins (SRC homology 2 (SH2) and phosphotyrosine binding 

(PTB and PH) domain containing proteins), proline-rich binding proteins (SH3 domain 
containing proteins), nucleotide exchange factors, and transcription factors. 

The term "coiled-coil structure region" as used herein, refers to a polypeptide 
sequence that has a high probability of adopting a coiled-coil structure as predicted by 

20 computer algorithms such as COILS (Lupas, A. (1996) Meth. Enzymology 266:513-525). 

Coiled-coils are formed by two or three amphipathic a-helices in parallel. Coiled-coils can 
bind to coiled-coil domains of other polypeptides resulting in homo- or heterodimers 
(Lupas, A. (1991) Science 252:1 162-1 164). Coiled-coil-dependent oligomerization has 
been shown to be necessary for protein function including catalytic activity of 

25 serine/threonine kinases (Roe, J. et al (1997) J. Biol. Chem. 272:5838-5845). Coiled-coil 

regions in the proteins of the invention can be identified using these methods. They may 
be present as sub-domains of the N-terminal, kinase, or C-terminal domains of the 
polypeptides of the invention. 

The term "proline-rich region" as used herein, refers to a region of a protein kinase 

30 whose proline content over a given amino acid length is higher than the average content of 
this amino acid found in proteins (i.e., >10%). Proline-rich regions are easily discemable 
by visual inspection of amino acid sequences and quantitated by standard computer 
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sequence analysis programs such as the DNAStar program EditSeq. Proline-rich regions 
have been demonstrated to participate in regulatory protein -protein interactions. Among 
these interactions, those that are most relevant to this invention involve the "PxxP" proline 
rich motif found in certain protein kinases {i.e., human PAK1) and the SH3 domain of the 
adaptor molecule Nek (Galisteo, M.L. et al (1996) J. Biol. Chem. 271 :20997-21000). 
Other regulatory interactions involving "PxxP" proline-rich motifs include the WW 
domain (Sudol, M. (1996) Prog. Biophys. Mol. Bio. 65:1 13-132). Proline rich regions in 
the proteins of the invention can be identified using these methods. They may be present 
as subrdomains of the N-terminal, kinase, or C-terminal domains of the polypeptides of 
the invention. 

The term "spacer region" as used herein, refers to a region of the protein kinase 
located between predicted functional domains. The spacer region has no detectable 
homology to any amino acid sequence in the database, and can be identified by using a 
Smith-Waterman alignment of the protein sequence against the non-redundant protein 
database to define the C- and N-terminal boundaries of the flanking functional domains. 
Spacer regions may or may not play a fundamental role in protein kinase function. 
Precedence for the regulatory role of spacer regions in kinase function is provided by the 
role of the sre kinase spacer in inter-domain interactions (Xu, W. et al (1997) Nature 
385:595-602). Spacer regions in the proteins of the invention can be identified using these 
methods. They may be present as sub-domains of the N-terminal, kinase, or C-terminal 
domains of the polypeptides of the invention. 

The term "insert" as used herein refers to a portion of a protein kinase that is 
absent from a close homolog. Inserts may or may not by the product alternative splicing 
of exons. Inserts can be identified by using a Smith-Waterman sequence alignment of the 
protein sequence against the non-redundant protein database, or by means of a multiple 
sequence alignment of homologous sequences using the DNAStar program Megalign. 
Inserts may play a functional role by presenting a new interface for protein-protein 
interactions, or by interfering with such interactions. Insert regions in the proteins of the 
invention can be identified using these methods. They may be present as sub-domains of 
the N-tenminal, kinase, or C-terminal domains of the polypeptides of the invention. 
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The term "C-terminal tail" as used herein, refers to a C-terminal domain of a 
protein kinase, that by homology extends or protrudes past the C-terminal amino acid of 
its closest homolog. C-terminal tails can be identified by using a Smith-Waterman 
sequence alignment of the protein sequence against the non-redundant protein database, or 

5 by means of a multiple sequence alignment of homologous sequences using the DNAStar 

program Megalign. Depending on its length, a C-terminal tail may or may not play a 
regulatory role in kinase function. C-terminal tail regions in the proteins of the invention 
can be identified using these methods. They may be present as sub-domains of the N- 
terminal, kinase, or C-terminal domains of the polypeptides of the invention. 

] o Various low or high stringency hybridization conditions may be used depending 

upon the specificity and selectivity desired. These conditions are well-known to those 
skilled in the art. Under stringent hybridization conditions only highly complementary 
nucleic acid sequences hybridize. Preferably, such conditions prevent hybridization of 
nucleic acids having more than 1 or 2 mismatches out of 20 contiguous nucleotides, more 

15 preferably, such conditions prevent hybridization of nucleic acids having more than 1 or 2 

mismatches out of 50 contiguous nucleotides, most preferably, such conditions prevent 
hybridization of nucleic acids having more than 1 or 2 mismatches out of 1 00 contiguous 
nucleotides. In some instances, the conditions may prevent hybridization of nucleic acids 
having more than 5 mismatches in the full-length sequence. 

20 By stringent hybridization assay conditions is meant hybridization assay conditions 

at least as stringent as the following: hybridization in 50% formamide, 5X SSC, 50 mM 
NaH 2 PO<, pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon spemi DNA, and 5X Denhart 
solution at 42 °C overnight; washing with 2X SSC, 0.1% SDS at 45 °C; and washing with 
0.2X SSC, 0.1% SDS at 45 °C. Under some of the most stringent hybridization assay 

25 conditions, the second wash can be done with 0.1X SSC at a temperature up to 70 °C (pg. 

421, Berger et al (1987) Guide to Molecular Cloning Techniques , Meth. Enzym. vol. 152, 
hereby incorporated by reference herein including any figures, tables, or drawings.). 
However, other applications may require the use of conditions falling between these sets 
of conditions. Methods of determining the conditions required to achieve desired 

30 hybridizations are well-known to those with ordinary skill in the art, and are based on 

several factors, including but not limited to, the sequences to be hybridized and the 
samples to be tested. 
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In other preferred embodiments, the invention features isolated, enriched, or 
purified nucleic acid molecules encoding kinase polypeptides, further comprising a vector 
or promoter effective to initiate transcription in a host cell. The invention also features 
recombinant nucleic acid, preferably in a cell or an organism. The recombinant nucleic 
acid may contain a sequence selected from the group consisting of those set forth in SEQ 
ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:l 0, SEQ ID NO:l 1 , SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ED NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ 
ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, 
SEQ ID NO:34, SEQ ID NO:35, SEQ DO NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ 
ID NO:39, SEQ ID NO:40, SEQ ID NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, 
SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ 
ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ED NO:63, SEQ ID NO:64, SEQ ID NO:65, 
SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ 
ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81 , 
SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ 
ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ED NO:91, SEQ ED 
NO:92, SEQ ID NO:93, SEQ ED NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ED NO:97, 
SEQ ED NO:98, SEQ ED NO:99, SEQ ED NO: 100, SEQ ED NO: 101, SEQ ED NO: 102, 
SEQ ED NO: 1 03, SEQ ED NO: 1 04, SEQ ED NO: 1 05, SEQ ID NO: 1 06, SEQ ED NO: 1 07, 
SEQ ED NO: 1 08, SEQ ED NO: 1 09, SEQ ED NO: 1 1 0, SEQ ED NO: 1 1 1 , SEQ ID NO: 1 1 2, 
SEQ ED NO:l 13, SEQ ID NO:l 14, SEQ ID NO:l 15, SEQ ID NO:l 16, SEQ ID NO:l 17, 
SEQ ED NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ED NO: 1 20, and SEQ ID NO: 1 2 1 , or a functional 
derivative thereof and a vector or a promoter effective to initiate transcription in a host 
cell. The recombinant nucleic acid can alternatively contain a transcriptional initiation 
region functional in a cell, a sequence complementary to an RNA sequence encoding a 
kinase polypeptide and a transcriptional termination region functional in a cell. Specific 
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vectors and host cell combinations are discussed herein. The recombinant nucleic acid can 
also contain the full-length sequence encoding the protein kinase, or a domain, for 
example. 

The term "vector" relates to a single or double-stranded circular nucleic acid 
molecule that can be transfected into cells and replicated within or independently of a cell 
genome. A circular double-stranded nucleic acid molecule can be cut and thereby 
linearized upon treatment with restriction enzymes. An assortment of nucleic acid vectors, 
restriction enzymes, and the knowledge of the nucleotide sequences cut by restriction 
enzymes are readily available to those skilled in the art. A nucleic acid molecule encoding 
a kinase can be inserted into a vector by cutting the vector with restriction enzymes and 
ligating the two pieces together. 

The term "transfecting" defines a number of methods to insert a nucleic acid vector 
or other nucleic acid molecules into a cellular organism. These methods involve a variety 
of techniques, such as treating the cells with high concentrations of salt, an electric field, 
detergent, or DMSO to render the outer membrane or wall of the cells permeable to 
nucleic acid molecules of interest or use of various, viral transduction strategies. 

The term "promoter" as used herein, refers to nucleic acid sequence needed for 
gene sequence expression. Promoter regions vary from organism to organism, but are well 
known to persons skilled in the art for different organisms. For example, in prokaryotes, 
the promoter region contains both the promoter (which directs the initiation of RNA 
transcription) as well as the DNA sequences which, when transcribed into RNA, will 
signal synthesis initiation. Such regions will normally include those 5 '-non-coding 
sequences involved with initiation of transcription and translation, such as the TATA box, 
capping sequence, CAAT sequence, and the like. 

In preferred embodiments, the isolated nucleic acid comprises, consists essentially 
of, or consists of a nucleic acid sequence set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ 
ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 10, SEQ ED NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ED NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO:20, SEQ DD NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ 
ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, 
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SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ED NO:39, SEQ ID NO:40, SEQ 
ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID 
NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID N0:5 1 , 
SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ 

5 ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID N0:61 , SEQ ID 

NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, 
SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71, SEQ ID NO:72, SEQ 
ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ DD NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ID NO:82, SEQ ID NO:83, 

10 SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ 

ID NO:89, SEQ ID NO:90, SEQ ID N0:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID 
NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, 
SEQ ID NO: 1 00, SEQ ID NO: 1 01 , SEQ ID NO: 1 02, SEQ ID NO: 1 03, SEQ ID NO: 1 04, 
SEQ ID NO: 1 05, SEQ ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 1 08, SEQ ID NO: 1 09, 

15 SEQ ID NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 1 4, 

SEQ DDNO:l 15, SEQ ED NO:l 16, SEQ ID NO:l 17, SEQ ID NO:l 18, SEQ ID NO: 1 19, 
SEQ ID NO: 120, and SEQ ID NO:121, or the corresponding full-length sequence, encodes 
an amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 

20 NO: 127, SEQ DD NO: 128, SEQ DD NO: 129, SEQ DD NO: 130, SEQ DD NO: 131, SEQ ID 
NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ DD NO: 139, SEQ ED NO: 140, SEQ ED NO: 141, SEQ ID 
NO:142, SEQ DD NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 1 47, SEQ DD NO: 1 48, SEQ DD NO: 1 49, SEQ DD NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 

25 NO: 152, SEQ ID NO: 153, SEQ DD NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ DD 
NO: 1 57, SEQ ED NO: 1 58, SEQ DD NO: 1 59, SEQ DD NO: 1 60, SEQ DD NO: 1 61 , SEQ ID 
NO:162, SEQDDNO:163, SEQDDNO:164, SEQDDNO:165. SEQDDNO:166, SEQ ID 
NO: 167, SEQ DD NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:171, SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ DD NO: 1 74, SEQ DD NO: 1 75, SEQ ID NO: 1 76, SEQ ID 

30 NO: 1 77, SEQ DD NO: 1 78, SEQ DD NO: 1 79, SEQ DD NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ DD NO: 1 83, SEQ DD NO: 1 84, SEQ DD NO: 1 85, SEQ DD NO: 1 86, SEQ ID 
NO: 1 87, SEQ DD NO: 1 88, SEQ DD NO: 1 89, SEQ ED NO: 1 90, SEQ DD NO: 1 91 , SEQ DD 
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NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
5 NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 

NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

10 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, a functional derivative 
thereof, or at least 10, 20, 40, 50, 75, 1 00, 200, 300 or 500 contiguous amino acids of a 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 

15 NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, SEQ ID NO: 132, SEQ ID 

NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO: 141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID 

20 NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID N0.156, SEQ ID NO:157, SEQ ID 
NO: 158, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID 
NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID 
NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 

25 NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 180, SEQ ID NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED 
NO:183, SEQ EDN0.184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO: 188, SEQ ED NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
NO: 1 93, SEQ ED NO: 1 94, SEQ ID NO: 1 95 , SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 

30 NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
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NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235 5 SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

5 NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242 5 or the 
corresponding full-length sequences or derivatives thereof. The nucleic acid may be 
isolated from a natural source by cDNA cloning or by subtractive hybridization. The 
natural source may be mammalian, preferably human, blood, semen, or tissue, and the 
nucleic acid may be synthesized by the triester method or by using an automated DNA 

1 0 synthesizer. 

The term "mammal" refers preferably to such organisms as mice, rats, rabbits, 
guinea pigs, sheep, and goats, more preferably to cats, dogs, monkeys, and apes, and most 
preferably to humans. 

In yet other preferred embodiments, the nucleic acid is a conserved or unique 

1 5 region, for example those useful for: the design of hybridization probes to facilitate 

identification and cloning of additional polypeptides, the design of PCR probes to 
facilitate cloning of additional polypeptides, obtaining antibodies to polypeptide regions, 
and designing antisense oligonucleotides. 

By "conserved nucleic acid regions", are meant regions present on two or more 

20 nucleic acids encoding a kinase polypeptide, to which a particular nucleic acid sequence 

can hybridize under lower stringency conditions. Examples of lower stringency 
conditions suitable for screening for nucleic acid encoding kinase polypeptides are 
provided in Berger et al. (1987) Guide to Molecular Cloning Techniques , Meth. Enzym. 
vol. 152, hereby incorporated by reference herein in its entirety, including any drawings, 

25 figures, or tables. Preferably, conserved regions differ by no more than 5 out of 20 

nucleotides, even more preferably 2 out of 20 nucleotides or most preferably 1 out of 20 
nucleotides. 

By <l unique nucleic acid region" is meant a sequence present in a nucleic acid 
coding for a kinase polypeptide that is not present in a sequence coding for any other 
30 naturally occurring polypeptide. Such regions preferably encode 10 (preferably 25, more 
preferably 50, most preferably 75) or more contiguous amino acids selected from the 
group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
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SEQ ED NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO: 1 30, SEQ ID NO: 1 31 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, 
SEQ ID NO: 1 35, SEQ ED NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 138, SEQ ID NO: 1 39, 
SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 

5 SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, 
SEQ ID NO: 155, SEQ ID NO:156, SEQ ID N0.157, SEQ ID NO: 158, SEQ ED NO: 159, 
SEQ ED NO: 160, SEQ ED NO: 161, SEQ ED NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, 
SEQ ID NO: 165. SEQ ED NO: 166, SEQ ED NO: 167, SEQ ED NO: 168, SEQ ID NO: 169, 

10 SEQ ID NO:170, SEQ ED NO:171, SEQ ED NO:172, SEQ ID NO:173, SEQ ID NO:174, 
SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 178, SEQ ID NO: 1 79, 
SEQ ID NO:l 80, SEQ ID NO:l 81 , SEQ ID NO:l 82, SEQ ID NO:l 83, SEQ ID NO:l 84, 
SEQ ED NO: 185, SEQ ID NO:186, SEQ ID NO: 187, SEQ ID NO:188, SEQ ID NO: 189, 
SEQ ID NO: 190, SEQ ID NO: 191, SEQ ED NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, 

15 SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, 
SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, 
SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ED NO:2 1 2, SEQ ED NO:2 1 3, SEQ ID NO:2 14, 
SEQ ED NO:215, SEQ ID NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ ID NO:219, 

20 SEQ ID NO:220, SEQ ED NO:221 , SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, 

SEQ ID NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, 
SEQ ID NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ED NO:234, 
SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ED NO:241 , and SEQ ED NO:242, or functional derivatives thereof. 

25 In particular, a unique nucleic acid region is preferably of mammalian origin and 

preferably human. 

A second aspect of the invention features a nucleic acid probe for the detection of 
nucleic acid encoding a kinase polypeptide in a sample, wherein said polypeptide is 
selected from the group consisting of SEQ ED NO:122, SEQ ED NO:123, SEQ ID NO:124, 
30 SEQ ID NO: 125, SEQ ID NO: 126, SEQ ED NO: 127, SEQ ED NO: 128, SEQ ID NO: 129, 
SEQ ID NO.130, SEQ ED NO:131, SEQ ED NO: 132, SEQ ED NO:133, SEQ ED NO: 134, 
SEQ ID NO: 135, SEQ ED NO: 136, SEQ ED NO: 137, SEQ ED NO:138, SEQ ID NO: 139, 
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SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 
SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ ID NO:150, SEQ IDN0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, 
SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, 
SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, 
SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, 
SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO:l 83, SEQ ID NO: 184, 
SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO:188, SEQ ID NO: 189, 
SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID N0.215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID N0.237, SEQ ID N0.238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. Preferably, the nucleic acid 
probe encodes a kinase polypeptide that is a fragment of the protein encoded by an amino 
acid sequence selected from the group consisting of those set forth in SEQ ID NO: 122, 
SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ED NO:126, SEQ ID NO:127, 
SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO:132, 
SEQ ED NO:133, SEQ ED NO:l 34, SEQ ID NO:135, SEQ ED NO:l 36, SEQ ED NO:137, 
SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ED NO:141, SEQ ID NO:142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ID NO:147, 
SEQ ID NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, 
SEQ ED NO:153, SEQ ID NO: 154, SEQ ID NO:155, SEQ ID NO:l 56, SEQ ID NO:157, 
SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, 
SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, 
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SEQ ID NO: 168, SEQ ED NO:169, SEQ tD NO: 170, SEQ ED NO:171, SEQ ID NO: 172, 
SEQ ED NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ED NO: 176, SEQ ID NO: 177, 
SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, 
SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, 

5 SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ID NO: 1 99, 
SEQ ED NO:193, SEQ ED NO:194, SEQ ED NO:195, SEQ ED NO:196, SEQ ED NO:197, 
SEQ ED NO: 198, SEQ ED NO: 199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID NO.202, 
SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ED_NO:208,.SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , SEQ ID NO:21 2, 

10 SEQ ED NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID N0.217, 

SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, 

1 5 SEQ ID NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241 , and SEQ ED 

NO:242, or the corresponding full-length amino acid sequences. The nucleic acid probe 
contains a nucleotide base sequence that will hybridize to a sequence selected from the 
group consisting of those set forth in SEQ ED NO:l, SEQ ED NO:2, SEQ ED NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ED NO:7, SEQ ED NO:8, SEQ ID NO:9, 

20 SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ED NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 1 4, SEQ 
ID NO: 1 5, SEQ ID NO: 1 6, SEQ ED NO: 1 7, SEQ ID NO: 1 8, SEQ ED NO: 1 9, SEQ ID 
NO:20, SEQ ED NO:21 , SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, 
SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO.30, SEQ 
ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33, SEQ ED NO:34, SEQ ID NO:35, SEQ ID 

25 NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:4 1 , 
SEQ ED NO:42, SEQ ED NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ 
ID NO:47, SEQ ED NO:48, SEQ ED NO:49, SEQ ED NO:50, SEQ ED NO:51 , SEQ ID 
NO:52, SEQ ED NO:53, SEQ ED NO:54, SEQ ED NO:55, SEQ ED NO:56, SEQ ED NO:57, 
SEQ ED NO:58, SEQ ED NO:59, SEQ ED NO:60, SEQ ED NO:61, SEQ ED NO:62, SEQ 

30 ED NO:63, SEQ ED NO:64, SEQ ED NO:65, SEQ ED NO:66, SEQ ED NO:67, SEQ ED 

NO:68, SEQ ED NO:69, SEQ ED NO:70, SEQ ED NO:71, SEQ ED NO:72, SEQ ED NO:73, 
SEQ ED NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ID NO:77, SEQ ED NO:78, SEQ 
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ID NO:79, SEQ ID NO:80, SEQ ID NO:.81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, 
SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ 
ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 
NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID 
NO:105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID 
NO:l 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 1 4, SEQ ID 
NO: 1 1 5, SEQ ID NO:l 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ID 
NO:120, and SEQ ID NO:121, or the corresponding full-length sequence, or a functional 
derivative thereof. 

In preferred emhodiments, the nucleic acid probe hybridizes to nucleic acid 
encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous amino 
acids of a sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID 
NO:l 52, SEQ ID NO:l 53, SEQ ID NO:l 54, SEQ ID NO:l 55, SEQ ID NO: 1 56, SEQ ID 
NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO:l 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO:l 84, SEQ ID NO: 1 85 , SEQ ID NO: 1 86, SEQ ID 
NO:l 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID 
NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , SEQ ID 
NO:2 1 2, SEQ ED NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 1 5, SEQ ID NO:2 1 6, SEQ ID 
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NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ED NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
5 NO:237, SEQ DDNO:238, SEQ IDNO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or functional 
derivatives thereof. 

Methods for using the probes include detecting the presence or amount of kinase 
RNA in a sample by contacting the sample with a nucleic acid probe under conditions 

10 such that hybridization occurs and detecting the presence or amount of the probe bound to 
kinase RNA. The nucleic acid duplex formed between the probe and a nucleic acid 
sequence coding for a kinase polypeptide may be used in the identification of the sequence 
of the nucleic acid detected (Nelson et al, in Nonisotopic DNA Probe Techniques, 
Academic Press, San Diego, Kricka, ed., p. 275, 1992, hereby incorporated by reference 

1 5 herein in its entirety, including any drawings, figures, or tables). Kits for performing such 

methods may be constructed to include a container means having disposed therein a 



nucleic acid probe. 

In a third aspect, the invention describes a recombinant cell or tissue comprising a 
nucleic acid molecule encoding a kinase polypeptide selected from the group consisting of 
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SEQ ID NO: 187, SEQ ID NO:188, SEQ ID NO: 189, SEQ ID NO:190, SEQ ID NO: 191, 
SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO:195, SEQ ID NO: 196, 
SEQ ID N0:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ED NO:200, SEQ ED NO:201, 
SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, 

5 SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:2 1 0, SEQ ED NO:2 1 1 , 
SEQ ED NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, 
SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221, 
SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID N0.225, SEQ ED NO:226, 
. SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ED NO:231 , 

10 SEQ ED NO:232, SEQ ED NO:233, SEQ ID N0.234, SEQ ID NO:235, SEQ ID NO:236, 

SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242. In such cells, the nucleic acid may be under the control of the 
genomic regulatory elements, or may be under the control of exogenous regulatory 
elements including an exogenous promoter. By "exogenous" it is meant a promoter that is 

1 5 not normally coupled in vivo transcriptionally to the coding sequence for the kinase 

polypeptides. 

The polypeptide is preferably a fragment of the protein encoded by an amino acid 
sequence selected from the group consisting of those set forth in SEQ ED NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ED NO: 125, SEQ ED NO: 126, SEQ ED NO: 127, SEQ ID 

20 NO: 128, SEQ ID NO: 129, SEQ ED NO: 130, SEQ ED NO: 131, SEQ ID NO: 132, SEQ ID 
NO:133, SEQ ED NO:134, SEQ ED NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO:138, SEQ IDNO:139, SEQ ED NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ EDNO:144, SEQ ED NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID 

25 NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ED NO:156, SEQ ED NO:157, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO:161, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ID 
NO:173, SEQ EDNO:174, SEQ ED NO:175, SEQ ED NO:176, SEQ ED NO:177, SEQ ID 

30 NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID 
NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ID 
NO: 188, SEQ ED NO: 189, SEQ ED NO. 190, SEQ ID NO: 191, SEQ ED NO: 199, SEQ ED 
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NO: 193, SEQ ID NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
5 NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ID N0.232, SEQ ID 
NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID 
1 0 NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ED NO:242, or the 
corresponding full-length amino acid sequence. By "fragment," is meant an amino acid 
sequence present in a kinase polypeptide. Preferably, such a sequence comprises at least 
10, 20, 40, 50, 75, 100, 200, or 300 contiguous amino acids a sequence selected from the 
group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
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SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:2l9, 
SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, 
SEQ ED NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, 
SEQ ED NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242, or of the corresponding full- 
length amino acid sequence, or a functional derivative thereof. 

In a fourth aspect, the invention features an isolated, enriched, or purified kinase 
polypeptide selected from the group consisting of SEQ ED NO: 122, SEQ ID NO: 123, SEQ 
ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ 
ID NO: 129, SEQ ID NO: 130, SEQ ED NO: 131, SEQ ED NO: 132, SEQ ID NO: 133, SEQ 
ED NO: 134, SEQ ED NO: 135, SEQ ED NO: 136, SEQ ED NO: 137, SEQ ID NO: 138, SEQ 
ED NO:139, SEQ ED NO: 140, SEQ ED NO: 141, SEQ ED NO: 142, SEQ ED NO: 143, SEQ 
ED NO:144, SEQ ED NO: 145, SEQ ED NO: 146, SEQ ED NO: 147, SEQ ED NO: 148, SEQ 
ED NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ED NO: 1 52, SEQ ED NO: 1 53, SEQ 
ID NO: 154, SEQ ED NO: 155, SEQ ED NO: 156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ 
ID NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 61 , SEQ ED NO: 1 62, SEQ ID NO: 1 63, SEQ 
ED NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ 
ED NO:l 69, SEQ ED NO: 1 70, SEQ ED NO: 1 71 , SEQ ID NO:l 72, SEQ ID NO: 1 73, SEQ 
ID NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 
ED NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83 , SEQ 
ID NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ 
ID NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ 
ID NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ED NO: 197, SEQ ID NO: 198, SEQ 
ID NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ 
ED NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ED NO:210, SEQ ED NO:21 1 , SEQ ID NO:212, SEQ ED NO:213, SEQ 
ID NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ 
ID NO:219, SEQ ED NO:220, SEQ ED NO:221 , SEQ ED NO:222, SEQ ED NO:223, SEQ 
ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ 
ED NO:229, SEQ ED NO:230, SEQ ID NO:231 , SEQ ED NO:232, SEQ ID NO:233, SEQ 
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ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242. 

By "isolated" in reference to a polypeptide is meant a polymer of amino acids (2 or 
more amino acids) conjugated to each other, including polypeptides that are isolated from 
5 a natural source or that are synthesized. The isolated polypeptides of the present invention 

are unique in the sense that they are not found in a pure or separated state in nature. Use 
of the term "isolated" indicates that a naturally occurring sequence has been removed from 
its normal cellular environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 

1 0 the only amino acid chain present, but that it is essentially free (about 90 - 95% pure at 
least) of non-amino acid material naturally associated with it. 

By the use of the term "enriched" in reference to a polypeptide is meant that the 
specific amino acid sequence constitutes a significantly higher fraction (2 - 5 fold) of the 
total amino acid sequences present in the cells or solution of interest than in normal or 

15 diseased cells or in the cells from which the sequence was taken. This could be caused by 

a person by preferential reduction in the amount of other amino acid sequences present, or 
by a preferential increase in the amount of the specific amino acid sequence of interest, or 
by a combination of the two. However, it should be noted that enriched does not imply 
that there are no other amino acid sequences present, just that the relative amount of the 

20 sequence of interest has been significantly increased. The term significant here is used to 

indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other amino acid sequences of about at least 2-fold, 
more preferably at least 5- to 10-fold or even more. The term also does not imply that 
there is no amino acid sequence from other sources. The other source of amino acid 

25 sequences may, for example, comprise amino acid sequence encoded by a yeast or 

bacterial genome, or a cloning vector such as pUC19. The term is meant to cover only 
those situations in which man has intervened to increase the proportion of the desired 
amino acid sequence. 

It is also advantageous for some purposes that an amino acid sequence be in 

30 purified form. The term "purified" in reference to a polypeptide does not require absolute 
purity (such as a homogeneous preparation); instead, it represents an indication that the 
sequence is relatively purer than in the natural environment. Compared to the natural level 
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this level should be at least 2-5 fold greater (e.g., in terms of mg/mL). Purification of at 
least one order of magnitude, preferably two or three orders, and more preferably four or 
five orders of magnitude is expressly contemplated. The substance is preferably free of 
contamination at a functionally significant level, for example 90%, 95%, or 99% pure. 
5 In preferred embodiments, the kinase polypeptide is a fragment of the protein 

encoded by an amino acid sequence selected from the group consisting of those set forth in 
SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, 
SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, 

10 SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO:141, 
SEQ ID NO: 142, SEQ 3D NO: 143, SEQ ID NO: 144, SEQ ED NO: 145, SEQ ED NO: 146, 
SEQ ED NO: 147, SEQ ED NO: 148, SEQ ED NO: 149, SEQ ED NO: 150, SEQ ID NO:151, 
SEQ ED NO:152, SEQ ED NO:153, SEQ ED NO:154, SEQ ED NO:155, SEQ ED NO:l 56, 
SEQ ED NO:l 57, SEQ ED NO: 1 58, SEQ ED NO:l 59, SEQ ED NO: 160, SEQ ED NO:l 61 , 

1 5 SEQ ED NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, 

SEQ ED NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 171, 
SEQ ED NO:l 72, SEQ ED NO: 1 73, SEQ ID NO:l 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, 
SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, 

20 SEQ ED NO: 187, SEQ ED NO: 188, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, 
SEQ ED NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, 
SEQ ED NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ID NO:201 , 
SEQ ED NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ED NO:206, 
SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:21 0, SEQ ID NO:21 1 , 

25 SEQ ED NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ED NO:216, 

SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ED NO:221, 
SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, 
SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231 , 
SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 

30 SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242, or the corresponding full-length amino acid sequences. Preferably, 
the kinase polypeptide contains at least 10, 20, 40, 50, 75, 100, 200, or 300 contiguous 
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amino acids a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ID N0.139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO:l 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
N0.157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ 1 D 
NO:172, SEQ ID N0.173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85 , SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ED NO:188, SEQ ID NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ID 
NO: 1 99, SEQ ED NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ED 
NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:201 , SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ IDNO:215, SEQ ID NO:216, SEQ ID 
NO:2 1 7, SEQ ID NO:2 1 8, SEQ ED NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 
NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
ED NO:242, or the corresponding full-length amino acid sequence, or a functional 
derivative thereof. 

In preferred embodiments, the kinase polypeptide comprises an amino acid 
sequence having (a) an amino acid sequence selected from the group consisting of those 
set forth in SEQ ED NO:122, SEQ ED NO:123, SEQ ED NO:124, SEQ ED NO:125, SEQ 
ED NO: 126, SEQ ED NO: 127, SEQ ED NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ 
ED NO:l 3 1 , SEQ ID NO:l 32, SEQ ED NO: 1 33, SEQ ED NO: 1 34, SEQ ID NO: 1 35, SEQ 
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ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ 
ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ 
ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ 
ID NO:151, SEQ ID NO: 152, SEQ ID NO:153, SEQ ED NO:154, SEQ ID NO:155, SEQ 
5 ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ 
ID NO: 1 61 , SEQ ED NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ED NO: 1 65 . SEQ 
ID NO: 1 66, SEQ ID NO: 1 67, SEQ ED NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ 
ID NO: 171, SEQ ED NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ED NO:175, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO:l 80, SEQ 

10 ID NO:l 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ 
ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ 
ID NO:191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ 
ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ 
ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ 

1 5 ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:2 1 0, SEQ 

ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

20 ID NO:231 , SEQ ID NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ID NO:242; (b) an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ 
ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ 

25 ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ 
ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ED NO: 138, SEQ ID NO: 139, SEQ 
IDNO:140, SEQ EDN0.141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ 
ID NO: 145, SEQ ID NO: 146, SEQ ED NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ 
ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ 

30 ID NO: 155, SEQ ED NO: 156, SEQ ID NO: 157, SEQ ED NO: 158, SEQ ID NO: 159, SEQ 
ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ 
ID NO:l 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ 
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ID NO: 170, SEQ ED NO:171, SEQ ED NO: 172, SEQ ID N0.173, SEQ ID NO: 174, SEQ 
ID NO:175, SEQ EDN0.176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ 
ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ 
ID NO:185, SEQ ED NO:l 86, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ 
ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ 
ID NO: 195, SEQ ID NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ 
ID NO:200, SEQ ED NO:201 , SEQ ED NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ED NO:205, SEQ ED NO:206, SEQ ED NO.207, SEQ ED NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ 
ID NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ 
ID NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ 
ID NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ 
ID NO:230, SEQ ID NO:23 1 , SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ 
ED NO:235, SEQ ED NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ED NO:241, and SEQ ID N0.242, except that it lacks one or more, but 
not all, of a domain selected from the group consisting of an N-terminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; (c) an amino acid sequence of a domain of 
a polypeptide selected from the group consisting of those set forth in SEQ ED NO: 122, 
SEQ ED NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, 
SEQ ED NO:128, SEQ ED NO:129, SEQ ED NO:130, SEQ ID NO:131, SEQ ID N0.132, 
SEQ ED NO: 133, SEQ ID NO: 134, SEQ ED NO: 135, SEQ ED NO: 136, SEQ ID NO: 137, 
SEQ ED NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, SEQ ED NO: 142, 
SEQ ED NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ED NO: 146, SEQ ED NO: 147, 
SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ED NO: 1 52, 
SEQ ED NO: 153, SEQ ED NO: 154, SEQ ED NO: 155, SEQ ED NO: 156, SEQ ED NO: 157, 
SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ED NO: 1 62, 
SEQ ED NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, 
SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ID NO: 1 72, 
SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, 
SEQ ED NO:l 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 81 , SEQ ID NO: 1 82, 
SEQ ID NO:183, SEQ ID NO: 184, SEQ ED NO:185, SEQ ED NO: 186, SEQ ED NO: 187, 
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SEQ ID NO: 188, SEQ ID NO:189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, 
SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, 
SEQ ID NO: 198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO.210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ IDNO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID 
NO:242 where the domain is selected from the group consisting of an N-termina] domain, 
a catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich 
region, a spacer region, an insert, and a C-terminal tail; or (d) an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 
SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, 
SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, 
SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 
SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, 
SEQ ID NO:l 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO:l 57, SEQ ID NO: 1 58, 
SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, 
SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, 
SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 173, 
SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ DD NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, 
SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, 
SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, 
SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
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SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:2U, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID N0:21 7, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID N0.222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID N0.241 , and SEQ ID NO:242, except that it 
lacks one or more, but not all, of the domains selected from the group consisting of a C- 
terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a proline-rich 
region, a coiled-coil structure region, an insert, and a C-terminal tail. (The domain 
demarcations of the polypeptides of the invention are indicated in Table 2 by reference to 
the kinase domain.) 

The polypeptide can be isolated from a natural source by methods well-known in 
the art. The natural source may be mammalian, preferably human, blood, semen, or tissue, 
and the polypeptide may be synthesized using an automated polypeptide synthesizer. The 
isolated, enriched, or purified kinase polypeptide is preferably selected from the group 
consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ED NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ 
ID NO:l 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ 
ID NO:l 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 139, SEQ 
ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ 
ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ 
ID NO:l 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ 
IDNO:155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ 
ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO. 163, SEQ ID NO: 164, SEQ 
ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ 
ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ 
IDNO:175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ 
ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ 
ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ 
ID NO: 190, SEQ ID NO: 191, SEQ ED NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ 
ID NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ 
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ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ 
5 ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 

ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID 1M0:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241, and SEQ ID NO:242A. 
]0 In some embodiments the invention includes a recombinant kinase polypeptide 

selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 
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SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. By "recombinant kinase 
polypeptide" is meant a polypeptide produced by recombinant DNA techniques such that 
it is distinct from a naturally occurring polypeptide either in its location (e.g., present in a 
different cell or tissue than found in nature), purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that 
normally observed in nature. 

- In a -fifth aspect, the invention features an antibody (e.g., a monoclonal or 

polyclonal antibody) having specific binding affinity to a kinase polypeptide or a kinase 
polypeptide domain or fragment where the polypeptide is selected from the group 
consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 
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ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241 , and SEQ ID NO:242. In preferred embodiments, the antibody binds 

5 specifically to domains of kinase polypeptides, that are defined supra. 

By "specific binding affinity" is meant that the antibody binds to the target kinase 
polypeptide with greater affinity than it binds to other polypeptides under specified 
conditions. Antibodies or antibody fragments are polypeptides that contain regions that 
can bind other polypeptides. The term "specific binding affinity" describes an antibody 

10 that binds to a kinase polypeptide with greater affinity than it binds to other polypeptides 

under specified conditions. 

The term "polyclonal" refers to antibodies that are heterogenous populations of 
antibody molecules derived from the sera of animals immunized with an antigen or an 
antigenic functional derivative thereof. For the production of polyclonal antibodies, 

1 5 various host animals may be immunized by injection with the antigen. Various adjuvants 

may be used to increase the immunological response, depending on the host species. 

"Monoclonal antibodies" are substantially homogenous populations of antibodies 
to a particular antigen. They may be obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. Monoclonal 

20 antibodies may be obtained by methods known to those skilled in the art (Kohler ei al. y 

Nature 256:495-497, 1975, and U.S. Patent No. 4,376,1 10, both of which are hereby 
incorporated by reference herein in their entirety including any figures, tables, or 
drawings). 

The term "antibody fragment" refers to a portion of an antibody, often the hyper 
25 variable region and portions of the surrounding heavy and light chains, that displays 

specific binding affinity for a particular molecule. A hyper variable region is a portion of 
an antibody that physically binds to the polypeptide target. 

Antibodies or antibody fragments having specific binding affinity to a kinase 
polypeptide or domains of a kinase polypeptide of the invention may be used in methods 
30 for detecting the presence and/or amount of kinase polypeptide in a sample by probing the 

sample with the antibody under conditions suitable for kinase-antibody immunocomplex 
formation and detecting the presence and/or amount of the antibody conjugated to the 



WO 00/73469 



PCT/US00/14842 



54 

kinase polypeptide. Diagnostic kits for performing such methods may be constructed to 
include antibodies or antibody fragments specific for the kinase as well as a conjugate of a 
binding partner of the antibodies or the antibodies themselves. 

An antibody or antibody fragment with specific binding affinity to a kinase 
5 polypeptide of the invention can be isolated, enriched, or purified from a prokaryotic or 

eukaryotic organism. Routine methods known to those skilled in the art enable production 
of antibodies or antibody fragments, in both prokaryotic and eukaryotic organisms. 
Purification, enrichment, and isolation of antibodies, which are polypeptide molecules, are 
described above. 

10 Antibodies having specific binding affinity to a kinase polypeptide of the invention 

may be used in methods for detecting the presence and/or amount of kinase polypeptide in 
a sample by contacting the sample with the antibody under conditions such that an 
immunocomplex forms and detecting the presence and/or amount of the antibody 
conjugated to the kinase polypeptide. Diagnostic kits for performing such methods may 

1 5 be constructed to include a first container containing the antibody and a second container 
having a conjugate of a binding partner of the antibody and a label, such as, for example, a 
radioisotope. The diagnostic kit may also include notification of an FDA approved use 
and instructions therefor. 

In a sixth aspect, the invention features a hybridoma which produces an antibody 

20 having specific binding affinity to a kinase polypeptide or a kinase polypeptide domain, 
where the polypeptide is selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID 

25 NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ED NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ED NO:146, SEQ ID NO:147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 

30 NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID 
NO: 173, SEQ IDNO:174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
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NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 

5 NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 

10 NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242; and 
where the domains are defined as above. By "hybridoma" is meant an immortalized cell 

1 5 line that is capable of secreting an antibody, for example an antibody to a kinase of the 

invention. In preferred embodiments, the antibody to the kinase comprises a sequence of 
amino acids that is able to specifically bind a kinase polypeptide of the invention. 



In a seventh aspect, the invention features a kinase polypeptide binding agent able 
to bind to a kinase polypeptide selected from the group consisting of SEQ ID NO: 122, 
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SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, 
SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO:l 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 

5 SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 

SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:21 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 

10 SEQ ED NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO.237, 

SEQ ID NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ID 
NO:242. The binding agent is preferably a purified antibody that recognizes an epitope 
present on a kinase polypeptide of the invention. Other binding agents include molecules 
that bind to kinase polypeptides and analogous molecules that bind to a kinase 

1 5 polypeptide. Such binding agents may be identified by using assays that measure kinase 

binding partner activity, such as those that measure PDGFR activity. 

The invention also features a method for screening for human cells containing a 
kinase polypeptide of the invention or an equivalent sequence. The method involves 
identifying the novel polypeptide in human cells using techniques that are routine and 

20 standard in the art, such as those described herein for identifying the kinases of the 

invention (e.g., cloning, Southern or Northern blot analysis, in situ hybridization, PCR 
amplification, etc.). 

In an eighth aspect, the invention features methods for identifying a substance that 
modulates kinase activity comprising the steps of: (a) contacting a kinase polypeptide 

25 selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ED NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, 
SEQ ID NO: 1 30, SEQ ED NO: 131, SEQ ED NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, 
SEQ ED NO: 1 35, SEQ ED NO: 136, SEQ ID NO: 137, SEQ ID NO: 1 38, SEQ ID NO: 1 39, 
SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

30 SEQ ID NO:145, SEQ ED NO:146, SEQ ED NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ ED NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, 
SEQ ED NO: 155, SEQ ED NO: 156, SEQ ED NO:157, SEQ ID NO: 158, SEQ ID NO: 159, 
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SEQ ID NO: 1 60, SEQ ID NO: 161, SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, 
SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, 
SEQ ID NO: 1 70, SEQ ID NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, 
SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ E> NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 188, SEQ ID NO: 1 89, 
SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, 
SEQ ID NO: 1 95 , SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:21 5, SEQ ID NO:216, SEQ ID NO:21 7, SEQ ID NO:21 8, SEQ ID NO:21 9, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242 with a test substance; (b) 
measuring the activity of said polypeptide; and (c) determining whether said substance 
modulates the activity of said polypeptide. 

The term "modulates" refers to the ability of a compound to alter the function of a 
kinase of the invention. A modulator preferably activates or inhibits the activity of a 
kinase of the invention. 

The term "activates" refers to increasing the cellular activity of the kinase. The 
term inhibit refers to decreasing the cellular activity of the kinase. Kinase activity is 
preferably the interaction with a natural binding partner. 

The term "modulates" also refers to altering the function of kinases of the 
invention by increasing or decreasing the probability that a complex forms between the 
kinase and a natural binding partner. A modulator preferably increases the probability that 
such a complex forms between the kinase and the natural binding partner, more preferably 
increases or decreases the probability that a complex forms between the kinase and the 
natural binding partner depending on the concentration of the compound exposed to the 
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kinase, and most preferably decreases the probability that a complex forms between the 
kinase and the natural binding partner. 

The term "complex" refers to an assembly of at least two molecules bound to one 
another. Signal transduction complexes often contain at least two protein molecules 
5 bound to one another. For instance, a protein tyrosine receptor protein kinase, GRB2, 

SOS, RAF, and RAS assemble to form a signal transduction complex in response to a 
mitogenic ligand. 

The term "natural binding partner" refers to polypeptides, lipids, small molecules, 
or nucleic acids that bind to kinases in cells. A change in the interaction between a kinase 

10 and a natural binding partner can manifest itself as an increased or decreased probability 

that the interaction forms, or an increased or decreased concentration of kinase/natural 
binding partner complex. 

The term "contacting" as used herein refers to mixing a solution comprising the 
test compound with a liquid medium bathing the cells of the methods. The solution 

15 comprising the compound may also comprise another component, such as dimethyl 

sulfoxide (DMSO), which facilitates the uptake of the test compound or compounds into 
the cells of the methods. The solution comprising the test compound may be added to the 
medium bathing the cells by utilizing a delivery apparatus, such as a pipet-based device or 
syringe-based device. 

20 In a ninth aspect, the invention features methods for identifying a substance that 

modulates kinase activity in a cell comprising the steps of: (a) expressing a kinase 
polypeptide in a cell, wherein said polypeptide is selected from the group consisting of 
SEQ ED NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, 
SEQ ED NO: 1 27, SEQ ID NO: 1 28, SEQ ED NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , 

25 SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, 

SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, 
SEQ ID NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, 
SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, 

30 SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , 
SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ED NO: 1 65 . SEQ ID NO: 1 66, 
SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , 
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and SEQ ID NO:242; (b) adding a test substance to said cell; and (c) monitoring a change 
in cell phenotype or the interaction between said polypeptide and a natural binding 
partner. 

The term "expressing" as used herein refers to the production of kinases of the 
invention from a nucleic acid vector containing kinase genes within a cell. The nucleic 
acid vector is transfected into cells using well known techniques in the art as described 
herein. 

In a tenth aspect, the invention provides methods for treating a disease or abnormal 
condition by administering to a patient in need of such treatment a substance that 
modulates the activity of a polypeptide selected from the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 41 , SEQ ID 
NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
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NO: 1 62, SEQ ID NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO:l 74, SEQ ID NO: 175, SEQ ED NO: 176, SEQ ID 
NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID 
5 NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ED NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

10 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ED NO:23 8, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

15 NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ 
ID NO:242. Preferably, the disease is selected from the group consisting of immune- 
related diseases and disorders, cardiovascular disease, neurodegenerative disorders, and 
cancer. Also included are metabolic disorders, such as diabetes mellitus, and reproductive 

20 disorders, such as infertility. 

Preferably, the disease or disorder is selected from the group consisting of 
rheumatoid arthritis, artherosclerosis, autoimmune disorders, and organ transplantation. 
Preferably the disease or disorder is selected from the group consisting of immune-related 
diseases and disorders, myocardial infarction, cardiomyopathies, stroke, renal failure, and 

25 oxidative stress-related neurodegenerative disorders. Most preferably, the immune-related 

diseases and disorders are selected from the group consisting of rheumatoid arthritis, 
chronic inflammatory bowel disease, chronic inflammatory pelvic disease, multiple 
sclerosis, asthma, osteoarthritis, psoriasis, atherosclerosis, rhinitis, autoimmunity, and 
organ transplantation. 

30 Substances useful for treatment of disorders or diseases preferably show positive 

results in one or more in vitro assays for an activity corresponding to treatment of the 
disease or disorder in question Substances that modulate the activity of the polypeptides 
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preferably include, but are not limited to, antisense oligonucleotides and inhibitors of 
protein kinases. 

The term "preventing" refers to decreasing the probability that an organism 
contracts or develops an abnormal condition. 
5 The term "treating" refers to having a therapeutic effect and at least partially 

alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors causing 
or contributing to the abnormal condition. A therapeutic effect relieves to some extent one 
or more of the symptoms of the abnormal condition. In reference to the treatment of 
10 abnormal conditions, a therapeutic effect can refer to one or more of the following: (a) an 

increase in the proliferation, growth, and/or differentiation of cells; (b) inhibition (i.e., 
slowing or stopping) of cell death; (c) inhibition of degeneration; (d) relieving to some 
extent one or more of the symptoms associated with the abnormal condition; and (e) 
enhancing the function of the affected population of cells. Compounds demonstrating 
15 efficacy against abnormal conditions can be identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
condition can relate to cell proliferation, cell differentiation or cell survival. An abnormal 
condition may also include irregularities in cell cycle progression, i.e., irregularities in 
20 normal cell cycle progression through mitosis and meiosis. 

Abnormal cell proliferative conditions include cancers such as fibrotic and 
mesangial disorders, abnormal angiogenesis and vasculogenesis, wound healing, psoriasis, 
diabetes mellitus, and inflammation. 

Abnormal differentiation conditions include, but are not limited to 
25 neurodegenerative disorders, slow wound healing rates, and slow tissue grafting healing 

rates. 

Abnormal cell survival conditions relate to conditions in which programmed cell 
death (apoptosis) pathways are activated or abrogated. A number of protein kinases are 
associated with the apoptosis pathways. Aberrations in the function of any one of the 
30 protein kinases could lead to cell immortality or premature cell death. 
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The term "aberration", in conjunction with the function of a kinase in a signal 
transduction process, refers to a kinase that is over- or under-expressed in an organism, 
mutated such that its catalytic activity is lower or higher than wild-type protein kinase 
activity, mutated such that it can no longer interact with a natural binding partner, is no 
longer modified by another protein kinase or protein phosphatase, or no longer interacts 
with a natural binding partner. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated when 
the cells or tissues of the organism exist within the organism or outside of the organism. 
Cells existing outside the organism can be maintained or grown in cell culture dishes. For 
cells harbored within the organism, many techniques exist in the art to administer 
compounds, including (but not limited to) oral, parenteral, dermal, injection, and aerosol 
applications. For cells outside of the organism, multiple techniques exist in the art to 
administer the compounds, including (but not limited to) cell microinjection techniques, 
transformation techniques, and carrier techniques. 

The abnormal condition can also be prevented or treated by administering a 
compound to a group of cells having an aberration in a signal transduction pathway to an 
organism. The effect of administering a compound on organism function can then be 
monitored. The organism is preferably a mouse, rat, rabbit, guinea pig, or goat, more 
preferably a monkey or ape, and most preferably a human. 

In an eleventh aspect, the invention features methods for detection the expression 
of a polypeptide in a sample as a diagnostic tool for diseases or disorders, wherein the 
method comprises the steps of: (a) contacting the sample with a nucleic acid probe which 
hybridizes under hybridization assay conditions to a nucleic acid target region of a kinase 
polypeptide selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ 
ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ED NO: 128, SEQ 
ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ 
ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 138, SEQ 
ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ 
ID NO: 144, SEQ ED NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ 
ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ 
ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 157, SEQ ID NO: 158, SEQ 
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ID NO: 1 59, SEQ ED NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ 
ID NO: 1 64, SEQ ED NO: 1 65. SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 169, SEQ DD NO: 170, SEQ ED NO: 171, SEQ ID NO: 172, SEQ ID N0.173, SEQ 
ED NO: 1 74, SEQ ID NO: 1 75, SEQ ED NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 
5 ID NO: 179, SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ 
ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO:l 88, SEQ 
ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ 
ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ 
ID NO:l 99, SEQ ED NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ 

1 0 ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 

ID NO:209, SEQ ED NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID NO:2 1 2, SEQ ID NO:2 1 3, SEQ 
ID NO:2 1 4, SEQ ID NO:2 1 5, SEQ ED NO:2 1 6, SEQ ED NO:2 1 7, SEQ ED NO:2 1 8, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221 , SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 

15 ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 
ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ED NO:242, said probe 
comprising the nucleic acid sequence encoding the polypeptide, fragments thereof, and the 
complements of the sequences and fragments; and (b) detecting the presence or amount of 

20 the probertarget region hybrid as an indication of the disease. 

In preferred embodiments of the invention, the disease or disorder is selected from 
the group consisting of rheumatoid arthritis, artherosclerosis, autoimmune disorders, organ 
transplantation, myocardial infarction, cardiomyopathies, stroke, renal failure, oxidative 
stress-related neurodegenerative disorders, metabolic disorder including diabetes, 

25 reproductive disorders including infertility, and cancer. 

The kinase "target region" is a nucleotide base sequence selected from the group 
consisting of those set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID 

30 NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, 
SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:23, SEQ ED NO:24, SEQ ID NO:25, SEQ 
ID NO:26, SEQ ID N0.27, SEQ ED NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ED 
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NO:31 , SEQ ID NO:32, SEQ ID NO:33, SEQ CD NO:34, SEQ ID NO:35, SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID N0:41, SEQ 
ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID N0:51, SEQ ID NO:52. 
5 SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ 

ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID N0:61 , SEQ ED NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, 
SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71 , SEQ ID NO:72, SEQ ID NO:73, SEQ 
ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID 

10 NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ED NO:82, SEQ ID NO:83, SEQ ID NO:84, 

SEQ ID NO:85, SEQ ED NO:86, SEQ ED NO:87, SEQ DD NO:88, SEQ ID NO:89, SEQ 
ID NO:90, SEQ DD N0:91, SEQ DD NO:92, SEQ DD NO:93, SEQ DD NO:94, SEQ DD 
NO:95, SEQ DD NO:96, SEQ ID NO:97, SEQ DD NO:98, SEQ ID NO:99, SEQ DD 
NO: 100, SEQ DD NO: 101, SEQ DD NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID 

15 NO: 105, SEQ DD NO: 106, SEQ DD NO: 107, SEQ DD NO: 108, SEQ ID NO: 109, SEQ ID 
NO:l 10, SEQ DD NO: 1 1 1, SEQ DD NO:l 12, SEQ DD NO:l 13, SEQ ID NO:l 14, SEQ ID 
NO: 1 1 5, SEQ DD NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ID 
NO: 120, and SEQ DD NO: 121 , or the corresponding full-length sequences, a functional 
derivative thereof, or a fragment thereof to which the nucleic acid probe will specifically 

20 hybridize. Specific hybridization indicates that in the presence of other nucleic acids the 

probe only hybridizes detectably with the kinase of the invention's target region. Putative 
target regions can be identified by methods well known in the art consisting of alignment 
and comparison of the most closely related sequences in the database. 

In preferred embodiments the nucleic acid probe hybridizes to a kinase target 

25 region encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous 

amino acids of the sequence set forth in SEQ DD NO: 122, SEQ DD NO: 123, SEQ ED 
NO:124, SEQ ID NO: 125, SEQ DD NO:126, SEQ DDNO:127, SEQ ID N0.128, SEQ ID 
NO:129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID 
NO: 134, SEQ ID NO: 135, SEQ DD NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID 

30 NO: 139, SEQ DD NO: 140, SEQ ID NO: 141, SEQ DD NO:142, SEQ ID NO: 143, SEQ ID 
NO: 1 44, SEQ ID NO: 1 45, SEQ DD NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID 
NO: 1 49, SEQ DD NO: 1 50, SEQ DD NO: 1 5 1 , SEQ DD NO: 1 52, SEQ DD NO: 1 53, SEQ ID 
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NO: 154, SEQ ID NO: 155, SEQ ED NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID 
NO: 159, SEQ ID NO: 160, SEQ ED NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ED 
NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID 
NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID 

5 NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID 
NO: 184, SEQ ED NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID 
NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID 
NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ED NO: 1 97, SEQ ID NO: 1 98, SEQ ID 

10 NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID NO:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 

1 5 NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ EDNO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID 
NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the corresponding 
full-length amino acid sequence, or a functional derivative thereof. Hybridization 

20 conditions should be such that hybridization occurs only with the kinase genes in the 

presence of other nucleic acid molecules. Under stringent hybridization conditions only 
highly complementary nucleic acid sequences hybridize. Preferably, such conditions 
prevent hybridization of nucleic acids having more than 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

25 Hybridization conditions should be such that hybridization occurs only with the 

genes in the presence of other nucleic acid molecules. Under stringent hybridization 
conditions only highly complementary nucleic acid sequences hybridize. Preferably, such 
conditions prevent hybridization of nucleic acids having 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

30 The diseases for which detection of kinase genes in a sample could be diagnostic 

include diseases in which kinase nucleic acid (DNA and/or RNA) is amplified in 
comparison to normal cells. By "amplification" is meant increased numbers of kinase 
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DNA or RNA in a cell compared with normal cells. In normal cells, kinases are typically 
found as single copy genes. In selected diseases, the chromosomal location of the kinase 
genes may be amplified, resulting in multiple copies of the gene, or amplification. Gene 
amplification can lead to amplification of kinase RNA, or kinase RNA can be amplified in 

5 the absence of kinase DNA amplification. 

"Amplification" as it refers to RNA can be the detectable presence of kinase RNA 
in cells, since in some normal cells there is no basal expression of kinase RNA. In other 
normal cells, a basal level of expression of kinase exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, kinase RNA, 

1 0 compared to the basal level. 

The diseases that could be diagnosed by detection of kinase nucleic acid in a 
sample preferably include cancers. The test samples suitable for nucleic acid probing 
methods of the present invention include, for example, cells or nucleic acid extracts of 
cells, or biological fluids. The samples used in the above-described methods will vary 

15 based on the assay format, the detection method and the nature of the tissues, cells or 

extracts to be assayed. Methods for preparing nucleic acid extracts of cells are well 
known in the art and can be readily adapted in order to obtain a sample that is compatible 
with the method utilized. 

Another aspect of the invention involves a method of agonizing (stimulating) or 

20 antagonizing a target of the invention and a natural binding partner associated activity in a 

mammal comprising administering to said mammal an agonist or antagonist to one of the 
above disclosed polypeptides in an amount sufficient to effect said agonism or 
antagonism. A method of treating diseases in a mammal with an agonist or antagonist of 
the protein of the present invention activity comprising administering the agonist or 

25 antagonist to a mammal in an amount sufficient to agonize or antagonize associated 

functions is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein polypeptides. Some small organic molecules form a class of compounds that 

30 modulate the function of protein polypeptides. Examples of molecules that have been 
reported to inhibit the function of protein kinases include, but are not limited to, bis 
monocyclic, bicyclic or heterocyclic aryl compounds (PCT WO 92/20642, published 
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November 26, 1992 by Maguire et al), vinylene-azaindole derivatives (PCT WO 
94/14808, published July 7, 1994 byBallinari et al), l-cyclopropyl-4-pyridyl-quinolones 
(U.S. Patent No. 5,330,992), styryl compounds (U.S. Patent No. 5,217,999), styryl- 
substituted pyridyl compounds (U.S. Patent No. 5,302,606), certain quinazoline 
5 derivatives (EP Application No. 0 566 266 Al), seleoindoles and selenides (PCT WO 

94/03427, published February 17, 1994 by Denny et al.), tricyclic polyhydroxylic 
compounds (PCT WO 92/21660, published December 10, 1992 by Dow), and 
benzylphosphonic acid compounds (PCT WO 91/15495, published October 17, 1991 by 
Dow et al), all of which are incorporated by reference herein, including any drawings. 

1 0 Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein inhibitors only 
weakly inhibit function. In addition, many inhibit a variety of protein kinases and will 
therefore cause multiple side-effects as therapeutics for diseases. 

15 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 by 
Ballinari et al) describes hydrosoluble indolinone compounds that harbor tetralin, 
naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in turn substituted with polar groups including hydroxylated alkyl, 

20 phosphate, and ether substituents. U.S. Patent Application Serial Nos. 08/702,232, filed 

August 23, 1996, entitled "Indolinone Combinatorial Libraries and Related Products and 
Methods for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 
221/187) and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 
Compounds for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 

25 223/298) and International Patent Publication WO 96/22976, published August 1 , 1 996 by 
Ballinari et al, all of which are incorporated herein by reference in their entirety, 
including any drawings, describe indolinone chemical libraries of indolinone compounds 
harboring other bicyclic moieties as well as monocyclic moieties fused to the oxindole 
ring. Applications 08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial 

30 Libraries and Related Products and Methods for the Treatment of Disease" by Tang et al 
(Lyon & Lyon Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled 
"Benzylidene-Z-Indoline Compounds for the Treatment of Disease" by Tang et al. (Lyon 
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& Lyon Docket No. 223/298), and WO 96/22976, published August 1, 1996 by Ballinari 
et al. teach methods of indolinone synthesis, methods of testing the biological activity of 
indolinone compounds in cells, and inhibition patterns of indolinone derivatives, both of 
which are incorporated by reference herein, including any drawings. 
5 Other examples of substances capable of modulating kinase activity include, but 

are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 
quinazohnes, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al., EPO Publication No. 0 520 722 

10 Al; Jones et al., U.S. Patent No. 4,447,608; Kabbe et al., U.S. Patent No. 4,757,072; Kaui 

and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al., Proc. 
of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 3:293- 
304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et al., Br. J. 

1 5 Cancer 53:361-368 (1986); Fernandes et al., Cancer Research 43:1 1 1 7-1 123 (1983); Ferris 

et al. J. Org. Chem. 44(2): 173-1 78; Fry et al., Science 265:1093-1095 (1994); Jackman et 
al., Cancer Research 5 1 :5579-5586 (1 98 1 ); Jones et al. J. Med. Chem. 29(6): 1 114-1118; 
Lee and Skibo. Biochemistry 26(23):7355-7362 (1987); Lemus et al.. J. Org. Chem. 
54:351 1-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975); Maxwell et a]., 

20 Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al.. Cancer Research 

45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 17(19): 1489-1 596 (1980); 
Reece et al., Cancer Research 47(1 1):2996-2999 (1977); Sculier et al., Cancer Immunol, 
and lmmunother. 23:A65 (1986); Sikora et al., Cancer Letters 23:289-295 (1984); and 
Sikora et al., Analytical Biochem. 172:344-355 (1988), all of which are incorporated 

25 herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

Quinolines are described in Dolle et al., J. Med. Chem. 37:2627-2629 (1994); 
MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al., J. Med. Chem. 36:425-432 

30 (1993); and Burke et al. BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 
are incorporated by reference in their entirety, including any drawings. 



WO 00/73469 



69 



PCT/US00/14842 



Tyrphostins are described in Allen et al., Clin. Exp. Immunol. 91:141-156 (1993); 
Anafi et al., Blood 82:12:3524-3529 (1993); Baker et al., J. Cell Sci. 102:543-555 (1992); 
Bilder et al., Amer. Physiol. Soc. pp. 6363-6143:C721-C730 (1991); Brunton et al., 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al., Experimental 

5 Cell Research 199:255-261 (1992); Dong et al., J. Leukocyte Biology 53:53-60 (1993); 

Dong et al., 3. Immunol. 151(5):2717-2724 (1993); Gazit et al., J. Med. Chem. 32:2344- 
2352 (1989); Gazit et al., " J. Med. Chem. 36:3556-3564 (1993); Kaur et al., Anti -Cancer 
Drugs 5:213-222 (1994); Kaur et al., King et al., Biochem. J. 275:413-418 (1991); Kuo et 
al., Cancer Letters 74:197-202 (1993); Levitzki, A., TheFASEBJ. 6:3275-3282 (1992); 

10 Lyall et al., J. Biol. Chem. 264:14503-14509 (1989); Peterson et al., The Prostate 22:335- 

345 (1993); Pillemer et al., Int. J. Cancer 50:80-85 (1992); Posner et al., Molecular 
Pharmacology 45:673-683 (1993); Rendu et al., Biol. Pharmacology 44(5):881-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, L 
Pharm. and Experimental Therapeutics 267(3): 1 19-1 125 (1993); Wolbring et al., LBioL 

15 Chem. 269(36):22470-22472 (1994); and Yoneda et al., Cancer Research 51:4430-4435 

(1991); all of which are incorporated herein by reference in their entirety, including any 
drawings. 

Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 

20 incorporated herein by reference in its entirety, including any drawings. 
Methods of Treating a Disease (Enablement - i.e.. Dosing) 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 

25 WO 96/22976, published August 1 1 996, both of which are incorporated herein by 

reference in their entirety, including any drawings, figures or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
30 treated, the particular composition being used and the size and physiological condition of 
the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
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formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC 50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

5 tumors and major organs can also be determined to facilitate the selection of drugs most 

appropriate to inhibit a disorder. Such measurements can be carried out. For example, 
HPLC analysis can be performed on the plasma of animals treated with the drug and the 
location of radiolabeled compounds can be deter-mined using detection methods such as 
X-ray, CAT scan and MR1 Compounds that show potent inhibitory activity in the 

1 0 screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

altering the chemical structure and retesting. In this regard, compounds displaying good 
pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 

15 1 ) the compound is administered to mice (an untreated control mouse should also be 

used); 2) blood samples are periodically obtained via the tail vein from one mouse in each 
treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition and the percent of lymphocytes versus polymorphonuclear cells. A 
comparison of results for each dosing regime with the controls indicates if toxicity is 

20 present. 

At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
F.nthanasia. Journal of American Veterinary Medical Assoc. , 202:229-249, 1993). 

25 Representative animals from each treatment group can then be examined by gross 

necropsy for immediate evidence of metastasis, unusual illness or toxicity. Gross 
abnormalities in tissue are noted and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 

30 effect the less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 
5 Plasma levels should reflect the potency of the drug. Generally, the more potent 

the compound the lower the plasma levels necessary to achieve efficacy. 

In a final aspect, the invention features a method for detection of a kinase 
polypeptide in a sample as a diagnostic tool for a disease or disorder, wherein the method 
.comprises: (a) comparing a nucleic acid target region encoding the kinase polypeptide in 

1 0 a sample, where the kinase polypeptide is selected from the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:l 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID 

15 NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ED NO: 146, SEQ ID 

NO:147, SEQ ED NO:148, SEQ ED NO:149, SEQ ED NO:150, SEQ ED NO:151, SEQ ED 
NO:152, SEQ ED NO:153, SEQ ED NO:154, SEQ ED NO:155, SEQ ED NO:156, SEQ ID 
NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 

20 NO: 1 67, SEQ ID NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO:l 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ID NO:l 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID 

25 NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ID 
NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED 

30 NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, SEQ ED NO:220, SEQ ED NO:22 1 , SEQ ID 
NO:222, SEQ ID N0.223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
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NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or one or more fragments thereof, with a control nucleic acid target region 
encoding the kinase polypeptide, or one or more fragments thereof; and (b) detecting 
5 differences in sequence or amount between the target region and the control target region, 

as an indication of the disease or disorder. Preferably, the disease or disorder is selected 
from the group consisting of immune-related diseases and disorders, organ transplantation, 
myocardial infarction, cardiovascular disease, stroke, renal failure, oxidative stress-related 
neurodegenerative disorders, and cancer. Immune-related diseases and disorders include, 

1 0 but are not limited to, those discussed previously. 

The term "comparing" as used herein refers to identifying discrepancies between 
the nucleic acid target region isolated from a sample, and the control nucleic acid target 
region. The discrepancies can be in the nucleotide sequences, e.g. insertions, deletions, or 
point mutations, or in the amount of a given nucleotide sequence. Methods to determine 

1 5 these discrepancies in sequences are well-known to one of ordinary skill in the art. The 

"control" nucleic acid target region refers to the sequence or amount of the sequence 
found in normal cells, e.g. cells that are not diseased as discussed previously. 
The term also includes anti-sense molecules drawn thereto. 
The invention has been described broadly and generically herein. Each of the 

20 narrower species and subgeneric groupings falling within the generic disclosure also form 

part of the invention. This includes the generic description of the invention with a proviso 
or negative limitation removing any subject matter from the genus, regardless of whether 
or not the excised material is specifically recited herein. For example, in some instances 
the nucleotide sequence of particular kinase polypeptides may not be part of a preferred 

25 embodiment. 

The summary of the invention described above is not limiting and other features 
and advantages of the invention will be apparent from the following detailed description of 
the invention, and from the claims. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figures 1 A to IBB shows the amino acid sequences of SEQ ID NO:122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID N0:127, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID 
NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
N0.198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ED NO:242. 

Figures 2A to 2MMMM shows the nucleic acid sequences of SEQ ID NO: 1 , SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, 
SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID 
NO:13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:23, SEQ 
ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID 
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NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ 
ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, 
5 SEQ ID NO:51 , SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ 
ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
N0:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, 
SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71 , SEQ 
ED NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 

1 0 NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:8 1 , SEQ ID NO:82, 
SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ 
ID N0.88, SEQ ED NO:89, SEQ ED NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ED NO:94, SEQ ED NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, 
SEQ ED NO:99, SEQ ED NO: 100, SEQ ED NO: 101, SEQ ED NO: 102, SEQ ID NO: 103, 

15 SEQ ED NO: 104, SEQ ED NO: 105, SEQ ED NO: 106, SEQ ID NO: 107, SEQ ED NO: 108, 
SEQ ED NO: 1 09, SEQ ED NO: 1 1 0, SEQ ED NO: 1 1 1 , SEQ ED NO: 1 1 2, SEQ ED NO: 1 1 3 , 
SEQ ED NO: 1 1 4, SEQ ED NO: 1 1 5, SEQ ED NO: 1 1 6, SEQ ED NO: 1 1 7, SEQ ED NO: 1 1 8, 
SEQ ED NO:l 19, SEQ ED NO: 120, and SEQ ED NO: 121. 

20 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates in part to kinase polypeptides, nucleic acids encoding 
such polypeptides, cells containing such nucleic acids, antibodies to such polypeptides, 
assays utilizing such polypeptides, and methods relating to all of the foregoing. The 
present invention is based upon the isolation and characterization of new kinase 

25 polypeptides. The polypeptides and nucleic acids may be produced using well-known and 
standard synthesis techniques when given the sequences presented herein. 

I. The Nucleic Acids of the Invention 

Included within the scope of this invention are the functional equivalents of the 
30 herein-described isolated nucleic acid molecules. The degeneracy of the genetic code 
permits substitution of certain codons by other codons that specify the same amino acid 
and hence would give rise to the same protein. The nucleic acid sequence can vary 
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substantially since, with the exception of methionine and tryptophan, the known amino 
acids can be coded for by more than one codon. Thus, portions or all of the kinase genes 
of the invention could be synthesized to give a nucleic acid sequence significantly 
different from one selected from the group consisting of those set forth in SEQ ID NO:l , 
5 SEQ ID NO:2, SEQ ED NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ED NO:10, SEQ ED NO:l 1 , SEQ ID NO:12, 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ED NO: 15, SEQ ED NO: 16, SEQ ED NO: 17, SEQ 
ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ED NO:22, SEQ ID 
NO:23, SEQ ED NO:24, SEQ ED NO:25, SEQ ED NO:26, SEQ ID NO:27, SEQ ED NO:28, 

10 SEQ ED NO:29, SEQ ID NO:30, SEQ ID NO:3 1 , SEQ ED NO:32, SEQ ED NO:33, SEQ 
ED NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ED 
NO:39, SEQ ED NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, 
SEQ ED NO:45, SEQ ED NO:46, SEQ ID NO:47, SEQ ED NO:48, SEQ ID NO:49, SEQ 
ID NO:50, SEQ ID NO:51 , SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID 

15 NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ED NO:59, SEQ ID NO:60, 

SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ 
ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71 , SEQ ID NO:72, SEQ ED NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, 
SEQ ED NO:77, SEQ ED NO:78, SEQ ED NO:79, SEQ ED NO:80, SEQ ID NO:81 , SEQ 

20 ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID 

NO:87, SEQ ED NO:88, SEQ ED NO:89, SEQ ID NO:90, SEQ ED NO:91 , SEQ ID NO:92, 
SEQ ED NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ED NO:96, SEQ ID NO:97, SEQ 
ED NO:98, SEQ ED NO:99, SEQ ED NO: 100, SEQ ID NO: 101, SEQ ED NO: 102, SEQ ID 
NO: 103, SEQ ID NO: 104, SEQ ED NO: 105, SEQ ID NO: 106, SEQ ED NO: 107, SEQ ID 

25 NO: 108, SEQ ID NO: 109, SEQ ED NO:l 10, SEQ ED NO:l 1 1, SEQ ID NO: 1 12, SEQ ED 

NO: 1 1 3, SEQ ID NO: 1 1 4, SEQ ED NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID 
NO:l 18, SEQ ID NO:l 19, SEQ ID NO: 120, and SEQ ID NO:121 . The encoded amino 
acid sequence thereof would, however, be preserved. 

In addition, the nucleic acid sequence may comprise a nucleotide sequence which 

30 results from the addition, deletion or substitution of at least one nucleotide to the 5 '-end 
and/or the 3'-end of the nucleic acid sequence shown in SEQ ID NO: 1 , SEQ TD NO:2, 
SEQ ED NO:3, SEQ ED NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO.7, SEQ ID 
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NO:8, SEQ ID NO:9, SEQ ED NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, 
SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ 
ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
5 SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ 
ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ 
ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 

10 NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61 , 
SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ 
ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, 
SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ 

15 ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID 

NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, 
SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ 
ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID 
NO: 1 04, SEQ ID NO: 1 05, SEQ ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 1 08, SEQ ID 

20 NO: 109, SEQ ID NO:l 10, SEQ ID NO: 1 1 1, SEQ ID NO:l 12, SEQ ID NO: 1 13, SEQ ID 
NO:l 14, SEQ ID NO: 11 5, SEQ ID NO:l 16, SEQ ID NO: 11 7, SEQ ID NO:l 18, SEQ ID 
NO:l 19, SEQ ID NO: 120, and SEQ ID NO: 121, or a derivative thereof. Any nucleotide 
or polynucleotide may be used in this regard, provided that its addition, deletion or 
substitution does not alter the amino acid sequence of SEQ ID NO: 122, SEQ ID NO: 123, 

25 SEQ ID NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, 
SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, 
SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ED NO: 143, 
SEQ ED NO: 144, SEQ ED NO: 145, SEQ ED NO: 146, SEQ ED NO: 147, SEQ ED NO: 148, 

30 SEQ ED NO:149, SEQ ED NO:150, SEQ ED NO:151, SEQ ED NO:152, SEQ ID NO:153, 
SEQ ED NO: 154, SEQ ED NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, 
SEQ ID NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ID NO: 163, 
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SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, 
SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:171, SEQ ID NO: 172, SEQ ID NO: 173, 
SEQ ID NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, 

5 SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ED NO: 188, 
SEQ ED NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, 
SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 
SEQ ID NO: 199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 

10 SEQ ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:21 3, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:21 8, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, 

15 SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 

SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242, that is encoded 
by the nucleotide sequence. For example, the present invention is intended to include any 
nucleic acid sequence resulting from the addition of ATG as an initiation codon at the 5'- 
end of the inventive nucleic acid sequence or its derivative, or from the addition of TTA, 

20 TAG or TGA as a termination codon at the 3'-end of the inventive nucleotide sequence or 
its derivative. Moreover, the nucleic acid molecule of the present invention may, as 
necessary, have restriction endonuclease recognition sites added to its 5 '-end and/or 3'- 
end. 

Such functional alterations of a given nucleic acid sequence afford an opportunity 
25 to promote secretion and/or processing of heterologous proteins encoded by foreign 

nucleic acid sequences fused thereto, for example. All variations of the nucleotide 
sequence of the kinase genes of the invention and fragments thereof permitted by the 
genetic code are, therefore, included in this invention. 

Further, it is possible to delete codons or to substitute one or more codons with 
30 codons other than degenerate codons to produce a structurally modified polypeptide, but 
one which has substantially the same utility or activity as the polypeptide produced by the 
unmodified nucleic acid molecule. As recognized in the art, the two polypeptides are 
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functionally equivalent, as are the two nucleic acid molecules that give rise to their 
production, even though the differences between the nucleic acid molecules are not related 
to the degeneracy of the genetic code. This is discussed further in the "Functional 
Derivatives" section, herein. 

5 Finally, many of the nucleic acid molecules of the invention are provided as a 

partial sequence only (Fig. 2A through 2QQ). However, it is standard for one of ordinary 
skill in the art to obtain a full-length sequence when provided with a partial sequence. 
Similarly, when provided with a partial or full-length sequence it is standard for one of 
ordinary skill in the art to obtain nucleic acid sequence coding for homologous proteins. 

10 Therefore, these nucleic acid molecules are also part of the invention. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1 . The protein kinases fall into 10 known groups: AGC, CAMK, CK1, 
CMGC, dsPK, EIFK, LIMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 

1 5 presumably define new protein kinase groups. 

Additional characteristics may be found, inter alia, in the tables, namely Table 1 , 
Table 2, Table 3 and Table 4, shown below. 

II. Nucleic Acid Probes, Methods, and Kits for Detection of Pr otein Kinases. 

20 A nucleic acid probe of the present invention may be used to probe an appropriate 

chromosomal or cDNA library by usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or cDNA library may be 
prepared from appropriate cells according to recognized methods in the art (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring Harbor Laboratory, 

25 Sambrook, Fritsch, & Maniatis, eds., 1989). 

In the alternative, chemical synthesis can be carried out in order to obtain nucleic 
acid probes having nucleotide sequences that correspond to N-terminal, kinase or C- 
terminal portions, for example, of the amino acid sequence of the polypeptide of interest. 
The synthesized nucleic acid probes may be used as primers in a polymerase chain 

30 reaction (PCR) carried out in accordance with recognized PCR techniques, essentially 
according to PCR Protocols, "A Guide to Methods and Applications", Academic Press, 
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Michael, et a/., eds., 1990, utilizing the appropriate chromosomal or cDNA library to 
obtain the fragment of the present invention. 

One skilled in the art can readily design such probes based on the sequence 
disclosed herein using methods of computer alignment and sequence analysis known in 
5 the art ("Molecular Cloning: A Laboratory Manual", 1989, supra). The hybridization 

probes of the present invention can be labeled by standard labeling techniques such as 
with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, 
and the like. After hybridization, the probes may be visualized using known methods. 
The nucleic acid probes of the present invention include RNA, as well as DNA 

1 0 probes, such probes being generated using techniques known in the art. The nucleic acid 

probe may be immobilized on a solid support. Examples of such solid supports include, 
but . are not limited to, plastics such as polycarbonate, complex carbohydrates such as 
agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such solid supports are well known in the 

15 art. 

The test samples suitable for nucleic acid probing methods of the present invention 
include, for example, cells or nucleic acid extracts of cells, or biological fluids. The 
samples used in the above-described methods will vary based on the assay format, the 
detection method and the nature of the tissues, cells or extracts to be assayed. Methods for 

20 preparing nucleic acid extracts of cells are well known in the art and can be readily 

adapted in order to obtain a sample that is compatible with the method utilized. 

One method of detecting the presence of nucleic acids of the invention in a sample 
comprises (a) contacting said sample with the above-described nucleic acid probe under 
conditions such that hybridization occurs, and (b) detecting the presence of said probe 

25 bound to said nucleic acid molecule. One skilled in the art would select the nucleic acid 

probe according to techniques known in the art as described above. Samples to be tested 
include but should not be limited to RNA samples of human tissue. 

A kit for detecting the presence of nucleic acids of the invention in a sample 
comprises at least one container means having disposed therein the above-described 

30 nucleic acid probe. The kit may further comprise other containers comprising one or more 

of the following: wash reagents and reagents capable of detecting the presence of bound 
nucleic acid probe. Examples of detection reagents include, but are not limited to 
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radiolabeled probes, enzymatic labeled probes (horseradish peroxidase, alkaline 
phosphatase), and affinity labeled probes (biotin, avidin, or steptavidin). 

In detail, a compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers 

5 or strips of plastic or paper. Such containers allow the efficient transfer of reagents from 

one compartment to another compartment such that the samples and reagents are not 
cross-contaminated and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the probe or 

10 primers used in the assay, containers which contain wash reagents (such as phosphate 

buffered saline, Tris-buffers, and the like), and containers which contain the reagents used 
to detect the hybridized probe, bound antibody, amplified product, or the like. One skilled 
in the art will readily recognize that the nucleic acid probes described in the present 
invention can readily be incorporated into one of the established kit formats that are well 

15 known in the art. 

HI. DNA Constructs Comprising a Protein Kinase Nucleic Acid Molecule and Cells 
Containing These Constructs. 

The present invention also relates to a recombinant DNA molecule comprising, 5' 
20 to 3', a promoter effective to initiate transcription in a host cell and the above-described 

nucleic acid molecules. In addition, the present invention relates to a recombinant DNA 
molecule comprising a vector and an above-described nucleic acid molecule. The present 
invention also relates to a nucleic acid molecule comprising a transcriptional region 
functional in a cell, a sequence complementary to an RNA sequence encoding an amino 
25 acid sequence corresponding to the above-described polypeptide, and a transcriptional 

termination region functional in said cell. The above-described molecules may be isolated 
and/or purified DNA molecules. 

The present invention also relates to a cell or organism that contains an above- 
described nucleic acid molecule and thereby is capable of expressing a polypeptide. The 
30 polypeptide may be purified from cells that have been altered to express the polypeptide. 
A cell is said to be "altered to express a desired polypeptide" when the cell, through 
genetic manipulation, is made to produce a protein which it normally does not produce or 
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which the cell normally produces at lower levels. One skilled in the art can readily adapt 
procedures for introducing and expressing either genomic, cDNA, or synthetic sequences 
into either eukaryotic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
5 polypeptide if it contains nucleotide sequences which contain transcriptional and 

translational regulatory information and such sequences are "operably linked" to 
nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in 
which the regulatory DNA sequences and the DNA sequence sought to be expressed are 
connected in such a way as to permit gene sequence expression. The precise nature of the 

1 0 regulatory regions needed for gene sequence expression may vary from organism to 

organism, but shall in general include a promoter region which, in prokaryotes, contains 
both the promoter (which directs the initiation of RNA transcription) as well as the DNA 
sequences which, when transcribed into RNA, will signal synthesis initiation. Such 
regions will normally include those 5 '-non-coding sequences involved with initiation of 

15 transcription and translation, such as the TATA box, capping sequence, CAAT sequence, 

and the like. 

If desired, the non-coding region 3' to the sequence encoding a kinase of the 
invention may be obtained by the above-described methods. This region may be retained 
for its transcriptional termination regulatory sequences, such as termination and 

20 polyadenylation. Thus, by retaining the 3'-region naturally contiguous to the DNA 

sequence encoding a kinase of the invention, the transcriptional termination signals may 
be provided. Where the transcriptional termination signals are not satisfactorily functional 
in the expression host cell, then a 3' region functional in the host cell may be substituted. 
Two DNA sequences (such as a promoter region sequence and a sequence 

25 encoding a kinase of the invention) are said to be operably linked if the nature of the 

linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region sequence to direct the 
transcription of a gene sequence encoding a kinase of the invention, or (3) interfere with 
the ability of the gene sequence of a kinase of the invention to be transcribed by the 

30 promoter region sequence. Thus, a promoter region would be operably linked to a DNA 
sequence if the promoter were capable of effecting transcription of that DNA sequence. 
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Thus, to express a gene encoding a kinase of the invention, transcriptional and 
translational signals recognized by an appropriate host are necessary. 

The present invention encompasses the expression of a gene encoding a kinase of 
the invention (or a functional derivative thereof) in either prokaryotic or eukaryotic cells. 
5 Prokaryotic hosts are, generally, very efficient and convenient for the production of 

recombinant proteins and are, therefore, one type of preferred expression system for 
kinases of the invention. Prokaryotes most frequently are represented by various strains of 
E. coli. However, other microbial strains may also be used, including other bacterial 
strains. 

10 In prokaryotic systems, plasmid vectors that contain replication sites and control 

sequences derived from a species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCl 18, pUCl 19 and the like; suitable 
phage or bacteriophage vectors may include ygtlO, ygtl 1 and the like; and suitable virus 
vectors may include pMAM-neo, pKRC and the like. Preferably, the selected vector of the 

15 present invention has the capacity to replicate in the selected host cell. 

Recognized prokaryotic hosts include bacteria such as E. coli, Bacillus, 
Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, under such 
conditions, the polypeptide will not be glycosylated. The prokaryotic host must be 
compatible with the replicon and control sequences in the expression plasmid. 

20 To express a kinase of the invention (or a functional derivative thereof) in a 

prokaryotic cell, it is necessary to operably link the sequence encoding the kinase of the 
invention to a functional prokaryotic promoter. Such promoters may be either constitutive 
or, more preferably, regulatable (i.e., inducible or derepressible). Examples of constitutive 
promoters include the int promoter of bacteriophage X, the bla promoter of the 0- 

25 lactamase gene sequence of pBR322, and the cat promoter of the chloramphenicol acetyl 

transferase gene sequence of pPR325, and the like. Examples of inducible prokaryotic 
promoters include the major right and left promoters of bacteriophage X (P L and P R ), the 
trp, recA, AacZ, XacI, and gal promoters of E. coli, the a-amylase (Ulmanen et al, J. 
Bacteriol. 162:176-182, 1985) and the c-28-specific promoters of B. subtilis (Gilman et 

30 al, Gene Sequence 32:1 1-20, 1984), the promoters of the bacteriophages of Bacillus 

(Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, Inc., NY, 1 982), and 
Streptomyces promoters (Ward et al, Mol. Gen. Genet. 203:468-478, 1986). Prokaryotic 
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promoters are reviewed by Glick (Ind. Microbiot. 1 :277-282, 1987), Cenatiempo 
(Biochimie 68:505-516, 1986), and Gottesman (Ann. Rev. Genet. 18:415-442, 1984). 

Proper expression in a prokaryotic cell also requires the presence of a ribosome- 
binding site upstream of the gene sequence-encoding sequence. Such ribosome-binding 
5 sites are disclosed, for example, by Gold et ai (Ann. Rev. Microbiol. 35:365-404, 1981). 

The selection of control sequences, expression vectors, transformation methods, and the 
like, are dependent on the type of host cell used to express the gene. As used herein, 
"cell", "cell line", and "cell culture" may be used interchangeably and all such 
designations include progeny. Thus, the words "transformants" or "transformed cells" 

10 include the primary subject cell and cultures derived therefrom, without regard to the 

number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant 
progeny have the same functionality as that of the originally transformed cell. 

Host cells which may be used in the expression systems of the present invention 

15 are not strictly limited, provided that they are suitable for use in the expression of the 

kinase polypeptide of interest. Suitable hosts may often include eukaryotic cells. 
Preferred eukaryotic hosts include, for example, yeast, fungi, insect cells, mammalian cells 
either in vivo, or in tissue culture. Mammalian cells which may be useful as hosts include 
HeLa cells, cells of fibroblast origin such as VERO or CHO-K1, or cells of lymphoid 

20 origin and their derivatives. Preferred mammalian host cells include SP2/0 and J558L, as 

well as neuroblastoma cell lines such as DVIR 332, which may provide better capacities for 
correct post-translational processing. 

In addition, plant cells are also available as hosts, and control sequences 
compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 

25 19S, and nopaline synthase promoter and polyadenylation signal sequences. Another 

preferred host is an insect cell, for example the Drosophila larvae. Using insect cells as 
hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, Science 
240:1453-1459, 1988). Alternatively, baculovirus vectors can be engineered to express 
large amounts of kinases of the invention in insect cells (Jasny, Science 238:1653, 1987; 

30 Miller et ai, In: Genetic Engineering, Vol. 8, Plenum, Setlow et a/., eds., pp. 277-297, 

1986). 
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Any of a series of yeast expression systems can be utilized which incorporate 
promoter and termination elements from the actively expressed sequences coding for 
glycolytic enzymes that are produced in large quantities when yeast are grown in mediums 
rich in glucose. Known glycolytic gene sequences can also provide very efficient 
transcriptional control signals. Yeast provides substantial advantages in that it can also 
carry out post-translational modifications. A number of recombinant DNA strategies exist 
utilizing strong promoter sequences and high copy number plasmids which can be utilized 
for production of the desired proteins in yeast. Yeast recognizes leader sequences on 
cloned mammalian genes and secretes peptides bearing leader sequences (i.e., pre- 
peptides). Several possible vector systems are available for the expression of kinases of 
the invention in a mammalian host. 

A wide variety of transcriptional and translational regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such as adenovirus, bovine 
papilloma virus, cytomegalovirus, simian virus, or the like, where the regulatory signals 
are associated with a particular gene sequence which has a high level of expression. 
Alternatively, promoters from mammalian expression products, such as actin, collagen, 
myosin, and the like, may be employed. Transcriptional initiation regulatory signals may 
be selected which allow for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest are regulatory signals which are temperature- 
sensitive so that by varying the temperature, expression can be repressed or initiated, or 
are subject to chemical (such as metabolite) regulation. 

Expression of kinases of the invention in eukaryotic hosts requires the use of 
eukaryotic regulatory regions. Such regions will, in general, include a promoter region 
sufficient to direct the initiation of RNA synthesis. Preferred eukaryotic promoters 
include, for example, the promoter of the mouse metallothionein 1 gene sequence (Hamer 
et ai, J. Mol. Appl. Gen. 1 :273-288, 1982); the TK promoter of Herpes virus (McKnight, 
Cell 31 :355-365, 1982); the SV40 early promoter (Benoist et ai, Nature (London) 
290:304-31, 1981); and the yeast gal4 gene sequence promoter (Johnston et ai, Proc. Natl. 
Acad. Sci. (USA) 79:6971-6975, 1982; Silver et ai, Proc. Natl. Acad. Sci. (USA) 
81:5951-5955, 1984). 
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Translation of eukaryotic mRNA is initiated at the codon that encodes the first 
methionine. For this reason, it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence which encodes a kinase of the invention (or a 
functional derivative thereof) does not contain any intervening codons which are capable 

5 of encoding a methionine (i.e., AUG). The presence of such codons results either in the 

formation of a fusion protein (if the AUG codon is in the same reading frame as the kinase 
of the invention coding sequence) or a frame-shift mutation (if the AUG codon is not in 
the same reading frame as the kinase of the invention coding sequence). 

A nucleic acid molecule encoding a kinase of the invention and an operably linked 

] 0 promoter may be introduced into a recipient prokaryotic or eukaryotic cell either as a 

nonreplicating DNA or RNA molecule, which may either be a linear molecule or, more 
preferably, a closed covalent circular molecule. Since such molecules are incapable of 
autonomous replication, the expression of the gene may occur through the transient 
expression of the introduced sequence. Alternatively, permanent expression may occur 

1 5 through the integration of the introduced DNA sequence into the host chromosome. 

A vector may be employed which is capable of integrating the desired gene 
sequences into the host cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The 

20 marker may provide for prototrophy to an auxotrophic host, biocide resistance, e.g., 

antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene 
sequence can either be directly linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transfection. Additional elements may also be needed 
for optimal synthesis of mRNA. These elements may include splice signals, as well as 

25 transcription promoters, enhancers, and termination signals. cDNA expression vectors 

incorporating such elements include those described by Okayama (Mol. Cell. Biol. 3:280-, 
1983). 

The introduced nucleic acid molecule can be incorporated into a plasmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of 
30 vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 
may be recognized and selected from those recipient cells which do not contain the vector; 
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the number of copies of the vector which are desired in a particular host; and whether it is 
desirable to be able to "shuttle" the vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids such as those capable of replication 
in£. coli (such as, for example, pBR322, ColEl, pSClOl, pACYC 184, ttVX; "Molecular 
5 Cloning: A Laboratory Manual", 1989, supra). Bacillus plasmids include pCl 94, pC221 , 
pT127, and the like (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, 
NY, pp. 307-329, 1982). Suitable Streptomyces plasmids include pi J101 (Kendall et al z 
J. Bacteriol. 169:4177-4183, 1987), and streptomyces bacteriophages such as 4>C3 1 
_(Chater et aL, In: Sixth Intemational Symposium on Actinomycetales Biology, Akademiai 
10 Kaido, Budapest, Hungary, pp. 45-54, 1986). Pseudomonas plasmids are reviewed by 
John et aL (Rev. Infect. Dis. 8:693-704, 1986), and Izaki (Jpn. J. Bacteriol. 33:729-742, 
1978). 

Preferred eukaryotic plasmids include, for example, BPV, vaccinia, SV40, 2- 
micron circle, and the like, or their derivatives. Such plasmids are well known in the art 

15 (Botstein et aL, Miami Wntr. Symp. 19:265-274, 1982; Broach, In: "The Molecular 

Biology of the Yeast Saccharomyces: Life Cycle and Inheritance", Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; Broach, Cell 28:203-204, 1982; 
Bollon et aL, J. Clin. Hematol. Oncol. 10:39-48, 1980; Maniatis, In: Cell Biology: A 
Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, NY, pp. 

20 563-608, 1980). 

Once the vector or nucleic acid molecule containing the construct(s) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropriate host 
cell by any of a variety of suitable means, i.e., transformation, transfection, conjugation, 
protoplast fusion, electroporation, particle gun technology, calcium phosphate- 

25 precipitation, direct microinjection, and the like. After the introduction of the vector, 

recipient cells are grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene(s) results in the production of a kinase of 
the invention, or fragments thereof. This can take place in the transformed cells as such, 
or following the induction of these cells to differentiate (for example, by administration of 

30 bromodeoxyuracil to neuroblastoma cells or the like). A variety of incubation conditions 
can be used to form the peptide of the present invention. The most preferred conditions 
are those which mimic physiological conditions. 
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IV. The Proteins of the Invention 

A variety of methodologies known in the art can be utilized to obtain the 
polypeptides of the present invention. The polypeptides may be purified from tissues or 
cells that naturally produce the polypeptides. Alternatively, the above-described isolated 
5 nucleic acid fragments could be used to express the kinases of the invention in any 

organism. The samples of the present invention include cells, protein extracts or 
membrane extracts of cells, or biological fluids. The samples will vary based on the assay 
format, the detection method, and the nature of the tissues, cells or extracts used as the 
sample. 

10 Any eukaryotic organism can be used as a source for the polypeptides of the 

invention, as long as the source organism naturally contains such polypeptides. As used 
herein, "source organism" refers to the original organism from which the amino acid 
sequence of the subunit is derived, regardless of the organism the subunit is expressed in 
and ultimately isolated from. 

15 One skilled in the art can readily follow known methods for isolating proteins in 

order to obtain the polypeptides free of natural contaminants. These include, but are not 
limited to: size-exclusion chromatography, HPLC, ion-exchange chromatography, and 
immuno-affmity chromatography. 

Further, the polypeptides of the invention include the full-length polypeptides that 

20 can be identified from the full-length or partial sequences encoded by SEQ ID NO:l 22, 

SEQ ED NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, 
SEQ ED NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO: 131, SEQ ID NO:132, 
SEQ ID NO: 1 33, SEQ ID NO:l 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, 
SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, 

25 SEQ ID NO:143, SEQ ID N0.144, SEQ ID NO:145, SEQ ID NO: 146, SEQ ID NO:147, 

SEQ ED NO:148, SEQ ID NO:149, SEQ ED NO:150, SEQ ID NO.T51, SEQ ID NO:152, 
SEQ ED NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, 
SEQ ID NO:l 58, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ED NO: 162, 
SEQ ID NO: 163, SEQ ID NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ED NO: 167, 

30 SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, 
SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, 
SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, 
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SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, 
SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, 
SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, 
SEQ ID NO: 198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
5 SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 

10 SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID 
NO:242 (Figure 1). In addition, the polypeptides of the invention include the domains of 
these polypeptides, including, but not limited to, the N-terminal, kinase/catalylic, and C- 

15 terminal domains. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1 . The protein kinases fall into 1 0 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EIFK, LDvlK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 

20 presumably define new protein kinase groups. 

Additional characteristics are shown in, inter alia, the tables, namely Table 1 , 
Table 2, Table 3 and Table 4, provided below. 



V. Antibodies. Hvbridomas. Methods of Use and Kits for Detection of Protein 

25 Kinases 

The present invention relates to an antibody having binding affinity to a kinase of 
the invention. The polypeptide may have an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ 

30 ID NO: 1 30, SEQ ID NO:l 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ 
ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ 
ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ 
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20 ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or a functional derivative thereof, or 

at least 9 contiguous amino acids thereof (preferably, at least 20, 30, 35, or 40 or more 
contiguous amino acids thereof). Alternatively, the antibody may bind to a part of the 
polypeptide not provided in the sequences above, but that is present in the full-length 
sequence of the polypeptide and that is easily obtained using methods standard in the art. 

25 Further, the antibody may bind specifically to particular domains of one or more of the 
kinases of the invention, including, but not. limited to, the N-terminal, kinase/catalytic, or 
C-terminal domains. 

The present invention also relates to an antibody having specific binding affinity to 
a kinase or kinase domain of the invention. Such an antibody may be isolated by 
30 comparing its binding affinity to a kinase of the invention with its binding affinity to other 
polypeptides. Those that bind selectively to a kinase of the invention would be chosen for 
use in methods requiring a distinction between a kinase of the invention and other 
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polypeptides. Such methods could include, but should not be limited to, the analysis of 
altered kinase expression in tissue containing other polypeptides. 

The kinases of the present invention can be used in a variety of procedures and 
methods, such as for the generation of antibodies, for use in identifying pharmaceutical 
5 compositions, and for studying DNA/protein interaction. 

The kinases of the present invention can be used to produce antibodies or 
hybridomas. One skilled in the art will recognize that if an antibody is desired, such a 
peptide could be generated as described herein and used as an immunogen. The antibodies 
of the present invention include monoclonal and polyclonal antibodies, as well fragments 
10 of these antibodies, and humanized forms. Humanized forms of the antibodies of the 

present invention may be generated using one of the procedures known in the art such as 
chimerization or CDR grafting. 

The present invention also relates to a hybridoma that produces the above- 
described monoclonal antibody, or binding fragment thereof. A hybridoma is an 
15 immortalized cell line that is capable of secreting a specific monoclonal antibody. 

In general, techniques for preparing monoclonal antibodies and hybridomas are 
well known in the art (Campbell, "Monoclonal Antibody Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology," Elsevier Science Publishers, 
Amsterdam, The Netherlands, 1984; St. Groth et al y J. Immunol. Methods 35:1-21, 1980). 
20 Any animal (mouse, rabbit, and the like) which is known to produce antibodies can be 

immunized with the selected polypeptide. Methods for immunization are well known in 
the art. Such methods include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that the amount of polypeptide used for 
immunization will vary based on the animal that is immunized, the antigenicity of the 
25 polypeptide and the site of injection. 

The polypeptide may be modified or administered in an adjuvant in order to 
increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide 
are well known in the art. Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or P-galactosidase) or through the inclusion of an 
30 adjuvant during immunization. 



WO 00/73469 



PCT/US00/14842 



91 

For monoclonal antibodies, spleen cells from the immunized animals are removed, 
fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. Any one of a number of methods well 
known in the art can be used to identify the hybridoma cell that produces an antibody with 
5 the desired characteristics. These include screening the hybridomas with an EL1SA assay, 
western blot analysis, or radioimmunoassay (Lutz et ai, Exp. Cell Res. 175:109-124, 
1988). Hybridomas secreting the desired antibodies are cloned and the class and subclass 
are determined using procedures known in the art (Campbell, "Monoclonal Antibody 
Technology: Laboratory Techniques in Biochemistry and Molecular Biology", supra, 
10 1984). 

For polyclonal antibodies, antibody-containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The above-described antibodies 
may be detectably labeled. Antibodies can be detectably labeled through the use of 

15 radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such 

as horse radish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as 
FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for 
accomplishing such labeling are well-known in the art, for example, see Stemberger et ai, 
J. Histochem. Cytochem. 18:315, 1970; Bayers ai, Meth. Enzym. 62:308-, 1979; Engval 

20 et ai, Immunol. 109:129-, 1972; Goding, J. Immunol. Meth. 13:215-, 1976. The labeled 

antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to 
identify cells or tissues that express a specific peptide. 

The above-described antibodies may also be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 

25 carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide 

and latex beads. Techniques for coupling antibodies to such solid supports are well known 
in the art (Weir et ai, "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10, 1986; Jacoby et ai, Meth. Enzym. 
34, Academic Press, N.Y., 1974). The immobilized antibodies of the present invention 

30 can be used for in vitro, in vivo, and in situ assays as well as in immunochromotography. 
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Furthermore, one skilled in the art can readily adapt currently available procedures, 
as well as the techniques, methods and kits disclosed herein with regard to antibodies, to 
generate peptides capable of binding to a specific peptide sequence in order to generate 
rationally designed antipeptide peptides (Hurby et al. 9 "Application of Synthetic Peptides: 
5 Antisense Peptides", In Synthetic Peptides, A User's Guide, W.H. Freeman, NY, pp. 289- 
307, 1992; Kaspczak et aL, Biochemistry 28:9230-9238, 1989). 

Anti-peptide peptides can be generated by replacing the basic amino acid residues 
found in the peptide sequences of the kinases of the invention with acidic residues, while 
maintaining hydrophobic and uncharged polar groups. For example, lysine, arginine, 
10 and/or histidine residues are replaced with aspartic acid or glutamic acid and glutamic acid 

residues are replaced by lysine, arginine or histidine. 

The present invention also encompasses a method of detecting a kinase 
polypeptide in a sample, comprising: (a) contacting the sample with an above-described 
antibody, under conditions such that immunocomplexes form, and (b) detecting the 
1 5 presence of said antibody bound to the polypeptide. In detail, the methods comprise 

incubating a test sample with one or more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. Altered levels of a kinase of the 
invention in a sample as compared to normal levels may indicate disease. 

Conditions for incubating an antibody with a test sample vary. Incubation 
20 conditions depend on the format employed in the assay, the detection methods employed, 

and the type and nature of the antibody used in the assay. One skilled in the art will 
recognize that any one of the commonly available immunological assay formats (such as 
radioimmunoassays, enzyme-linked immunosorbent assays, diffusion based Ouchterlony, 
or rocket immunofluorescent assays) can readily be adapted to employ the antibodies of 
25 the present invention. Examples of such assays can be found in Chard ("An Introduction 

to Radioimmunoassay and Related Techniques" Elsevier Science Publishers, Amsterdam, 
The Netherlands, 1986), Bullock et aL ('Techniques in Immunocytochemistry " Academic 
Press, Orlando, FL Vol. 1, 1982; Vol. 2, 1983; Vol. 3, 1985), Tijssen ("Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
30 Biology," Elsevier Science Publishers, Amsterdam, The Netherlands, 1985). 
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The immunological assay test samples of the present invention include cells, 
protein or membrane extracts of cells, or biological fluids such as blood, serum, plasma, or 
urine. The test samples used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample 
5 to be assayed. Methods for preparing protein extracts or membrane extracts of cells are 

well known in the art and can be readily be adapted in order to obtain a sample which is 
testable with the system utilized. 

A kit contains all the necessary reagents to carry out the previously described 
methods of detection. The kit may comprise: (i) a first container means containing an 
10 above-described antibody, and (ii) second container means containing a conjugate 

comprising a binding partner of the antibody and a label. In another preferred 
embodiment, the kit further comprises one or more other containers comprising one or 
more of the following: wash reagents and reagents capable of detecting the presence of 
bound antibodies. 

15 Examples of detection reagents include, but are not limited to, labeled secondary 

antibodies, or in the alternative, if the primary antibody is labeled, the chromophoric, 
enzymatic, or antibody binding reagents that are capable of reacting with the labeled 
antibody. The compartmentalized kit may be as described above for nucleic acid probe 
kits. One skilled in the art will readily recognize that the antibodies described in the 

20 present invention can readily be incorporated into one of the established kit formats that 

are well known in the art. 

VI. Isolation of Compounds That Interact With Protein Kinases 

The present invention also relates to a method of detecting a compound capable of 
binding to a protein kinase of the invention, comprising incubating the compound with a 
25 kinase of the invention and detecting the presence of the compound bound to the kinase. 

The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. 

The present invention also relates to a method of detecting an agonist or antagonist 
of kinase activity or kinase binding partner activity comprising incubating cells that 
30 produce a kinase of the invention in the presence of a compound and detecting changes in 
the level of kinase activity or kinase binding partner activity. The compounds thus 
identified would produce a change in activity indicative of the presence of the compound. 
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The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. Once the compound is identified it can be isolated using techniques well 
known in the art. 

The present invention also encompasses a method of agonizing (stimulating) or 
5 antagonizing kinase associated activity in a mammal comprising administering to said 

mammal an agonist or antagonist to a kinase of the invention in an amount sufficient to 
effect said agonism or antagonism. A method of treating diseases in a mammal with an 
agonist or antagonist of kinase activity comprising administering the agonist or antagonist 
to a mammal in an amount sufficient to agonize or antagonize kinase associated functions 

10 is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein kinases. Some small organic molecules form a class of compounds that modulate 
the function of protein kinases. Examples of molecules that have been reported to inhibit 

15 the function of protein kinases include, but are not limited to, bis monocyclic, bicyclic or 

heterocyclic aryl compounds (PCT WO 92/20642, published November 26, 1992 by 
Maguire et al.) 9 vinylene-azaindole derivatives (PCT WO 94/14808, published July 7, 
1994 by Ballinari et ai), l-cyclopropyl-4-pyridyl-quinolones (U.S. Patent No. 5,330,992), 
styryl compounds (U.S. Patent No. 5,217,999), styryl-substituted pyridyl compounds (U.S. 

20 Patent No. 5,302,606), certain quinazoline derivatives (EP Application No. 0 566 266 Al), 

seleoindoles and selenides (PCT WO 94/03427, published February 1 7, 1 994 by Denny et 
al\ tricyclic polyhydroxylic compounds (PCT WO 92/21660, published December 10, 
1992 by Dow), and benzylphosphonic acid compounds (PCT WO 91/15495, published 
October 17, 1991 by Dow et al). 

25 Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein kinase inhibitors 
only weakly inhibit the function of protein kinases. In addition, many inhibit a variety of 
protein kinases and will cause multiple side-effects as therapeutics for diseases. 

30 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 by 
Ballinari et ai) describes hydrosoluble indolinone compounds that harbor tetralin, 
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naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in turn substituted with polar moieties including hydroxylated alkyl, 
phosphate, and ether moieties. U.S. Patent Application Serial Nos. 08/702,232, filed 
August 23, 1996, entitled "Indolinone Combinatorial Libraries and Related Products and 
5 Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 221/1 87) 
and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline Compounds for the 
Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 223/298) and 
International Patent Publication WO 96/22976, published August 1, 1996 by Ballinari et 
al, all of which are incorporated herein by reference in their entirety, including any 

1 0 drawings, describe indolinone chemical libraries of indolinone compounds harboring other 
bicyclic moieties as well as monocyclic moieties fused to the oxindole ring. Applications 
08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon 
Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 

1 5 Compounds for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 

223/298), and WO 96/22976, published August 1, 1996 by Ballinari et al teach methods 
of indolinone synthesis, methods of testing the biological activity of indolinone 
compounds in cells, and inhibition patterns of indolinone derivatives. 

Other examples of substances capable of modulating kinase activity include, but 

20 are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 

quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al, EPO Publication No. 0 520 722 
Al ; Jones et al, U.S. Patent No.4,447,608; Kabbe et al, U.S. Patent No. 4,757,072; Kaul 

25 and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al, 
Proc. of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 
3:293-304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et 
al, Br. J. Cancer 53:361-368 (1986); Femandes et al, Cancer Research 43:1 117-1 123 

30 (1983); Ferris et al J. Org. Chem. 44(2): 173- 178; Fry et al, Science 265:1093-1095 

(1994); Jackman et al, Cancer Research 51 :5579-5586 (1981); Jones et al J. Med. Chem. 
29(6): 1 1 14-1118; Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al, J. 
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Org. Chem. 54:35 1 1 -35 1 8 (1 989); Ley and Seng, Synthesis 1 975 :41 5-522 ( 1 975); 
Maxwell et al, Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al, Cancer 
Research 45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 17(1 9): 1489-1 596 
(1980); Reece et al, Cancer Research 47(1 1):2996-2999 (1977); Sculier et al, Cancer 
5 Immunol, and Immunother. 23:A65 (1986); Sikora et al, Cancer Letters 23:289-295 
(1984); Sikora et al, Analytical Biochem. 172:344-355 (1988); all of which are 
incorporated herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

10 Quinolines are described in Dolle et al, J. Med. Chem. 37:2627-2629 (1994); 

MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al, J. Med. Chem. 36:425-432 
(1993); and Burke et al BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 
are incorporated by reference in their entirety, including any drawings. 

Tyrphostins are described in Allen et al, Clin. Exp. Immunol. 91 : 141 -156 (1 993); 

15 Anafi et al, Blood 82:12:3524-3529 (1993); Baker et al, J. Cell Sci. 102:543-555 (1 992); 

Bilder et al. Amer. Physiol. Soc. pp. 6363-6143:C721-C730 (1991); Brunton et al, 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al, Experimental 
Cell Research 199:255-261 (1992); Done et al. J. Leukocyte Biology 53:53-60 (1993): 
Dong etal. J. Immunol. 1 51 fS~>:27 1 7-2724 (1993); Gazit et al, J. Med. Chem. 32:2344- 

20 2352 (1989); Gazit et al, " J. Med. Chem. 36:3556-3564 (1993); Kaur et al, Anti-Cancer 
Drugs 5:213-222 (1994); Kaur et al, King et al, Biochem. J. 275:41 3-41 8 (1991); Kuo et 
al, Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEB J. 6:3275-3282 (1992); 
Lyall et al. J. Biol. Chem. 264:14503-14509 (1989); Peterson et al. The Prostate 22:335- 
345 (1993); Pillemer et al, Int. J. Cancer 50:80-85 (1992); Posner et al. Molecular 

25 Pharmacology 45:673-683 (1993); Rendu et al, Biol. Pharmacology 44(5):881-888 

(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, J, 
Pharm. and Experimental Therapeutics 267(3): 1 19-1 125 (1993); Wolbring et al, J. Biol. 
Chem. 269(36):22470-22472 (1994); and Yoneda et al, Cancer Research 51 :4430-4435 
(1991); all of which are incorporated herein by reference in their entirety, including any 

30 drawings. 
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Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 
incorporated herein by reference in its entirety, including any drawings. 
VII. Biological Significance, Applications and Clinical Relevance of Novel Protein 
5 Kinases 

For each protein kinase in this application, we provide a classification of the 
protein class and family to which it belongs, a summary of non-cataltyic protein motifs, a 
profile of its expression in several hundred tissue and cell sources, and a chromosomal 
location. This information can be used to suggest potential function, regulation or 
10 therapeutic utility for each of the proteins. 

The kinase classification and protein domains often reflect pathways, cellular roles, 
or mechanisms of up- or down-stream regulation. Also disease-relevant genes often occur 
in families of related genes. For example if one member of a kinase family functions as an 
oncogene, a tumor suppressor, or has been found to be disrupted in an immune, 
15 neurologic, cardiovascular, or metabolic disorder, frequently other family members may 

play a related role. 

The expression analysis organizes kinases into groups that are transcriptionally 
upregulated in tumors and those that are more restricted to specific tumor types such as 
melanoma or prostate. This analysis also identifies genes that are regulated in a cell cycle 

20 dependent manner, and are therefore likely to be involved in maintaining cell cycle 

checkpoints, entry, progression, or exit from mitosis, oversee DNA repair, or are involved 
in cell proliferation and genome stability. Expression data also can identify genes 
expressed in endothelial sources or other tissues that suggest a role in angiogenesis, 
thereby implicating them as targets for control of diseases that have an angiogenic 

25 component, such as cancer, endometriosis, retinopathy and macular degeneration, and 

various ischemic or vascular pathologies. A proteins' role in cell survival can also be 
suggested based on restricted expression in cells subjected to external stress such as 
oxidative damage, hypoxia, drugs such as cisplatinum, or irradiation. Metastases- 
associated genes can be implicated when expression is restricted to invading regions of a 

30 tumor, or is only seen in local or distant metastases compared to the primary tumor, or 

when a gene is upregulated during cell culture models of invasion, migration, or motility. 
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Chromosomal location can identify candidate targets for a tumor amplicon or a 
tumor-suppressor locus. Summaries of prevelant tumor amplicons are available in the 
literature, and can identify tumor types to experimentally be confirmed to contain 
amplified copies of a kinase gene which localizes to an adjacent region. 
5 Based on these criteria several kinases immediately stand out as being of potential 

therapeutic relevance. The protein kinases can be divided into the following disease- 
relevant categories (nucleotide Seq ID #s in parentheses): 

Tumor associated: Mok (SEQ ID NO:NO:57), EPK2, AA3 16804 (SEQ ID 
NO:l 1), AA435956 (SEQ ID NO:NO:48), AA278842 (SEQ ID NO:88), AA599286 (SEQ 
10 ID NO:89), AA826850 (SEQ ID NO:3), HRI (SEQ ID NO:73), MLK4 AA232253 (SEQ 
ID NO:82), AA883975 SGK 235 (SEQ ID NO:95), AA31 1714 (SEQ ID NO:101), 
MPSK1 (SEQ ID NO:l 10), R19609 (Seq ID11 1), AA383293 (SEQ ID NO:26). 

Prostate-specific: AA234451 (SEQ ID NO:47), TSK4 (SEQ ID NO:93), RIP4 
(SEQ ID NO:84), KJAA0965 (SEQ ID NO:8). 
] 5 Oncogenic or proliferation associated: K1AA0781 (SEQ ID NO:38), AA789239 

(SEQ ID NO:52), CCRK (SEQ ID NO:54), CLK4 (SEQ ID NO:55), H85389 (SEQ ID 
NO:97). 

Neuronal restricted: CAMKKB (SEQ ID NO:66) 

Hematopoietic expressed: PTK9L (SEQ ID NO:22), DRAK2 (SEQ ID NO:29), 
20 AI025291 (SEQ ID NO:94) 

Angiogenic or endothelial expressed: DRAK1 (SEQ ID NO:31), MAK-V (SEQ 
ID NO:40), TRAD (SEQ ID NO:44), MOK (SEQ ID NO:57), AA08847 (SEQ ID NO:78), 
HGP_66444466 (SEQ ID NO:79), RSK4 (SEQ ID NO: 16). 

Cell cycle regulated: AA454060 (SEQ ID NO:45), KIAA0999 (Mitotic - SEQ ID 
25 NO:32), AA579641 (Mitotic - SEQ ID NO:60), AA305176 (Mitotic - SEQ ID NO:6), 
AA018361 (SI phase - SEQ ED NO: 100). 
VEX Transgenic Animals. 

A variety of methods are available for the production of transgenic animals 
associated with this invention. DNA can be injected into the pronucleus of a fertilized egg 
30 before fusion of the male and female pronuclei, or injected into the nucleus of an 

embryonic cell (e.g., the nucleus of a two-cell embryo) following the initiation of cell 
division (Brinster et al. , Proc. Nat. Acad. Sci. USA 82: 4438-4442, 1985). Embryos can 
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be infected with viruses, especially retroviruses, modified to carry inorganic-ion receptor 
nucleotide sequences of the invention. 

Pluripotent stem cells derived from the inner cell mass of the embryo and 
stabilized in culture can be manipulated in culture to incorporate nucleotide sequences of 
5 the invention. A transgenic animal can be produced from such cells through implantation 

into a blastocyst that is implanted into a foster mother and allowed to come to term. 
Animals suitable for transgenic experiments can be obtained from standard commercial 
sources such as Charles River (Wilmington, MA), Taconic (Germantown, NY), Harlan 
Sprague Dawley (Indianapolis, IN), etc. 

10 The procedures for manipulation of the rodent embryo and for microinjection of 

DNA into the pronucleus of the zygote are well known to those of ordinary skill in the art 
(Hogan et al. 9 supra). Microinjection procedures for fish, amphibian eggs and birds are 
detailed in Houdebine and Chourrout (Experientia 47: 897-905, 1991). Other procedures 
for introduction of DNA into tissues of animals are described in U.S. Patent No., 

15 4,945,050 (Sanford el ai, July 30, 1990). 

By way of example only, to prepare a transgenic mouse, female mice are induced 
to superovulate. Females are placed with males, and the mated females are sacrificed by 
CO2 asphyxiation or cervical dislocation and embryos are recovered from excised 
oviducts. Surrounding cumulus cells are removed. Pronuclear embryos are then washed 

20 and stored until the time of injection. Randomly cycling adult female mice are paired with 

vasectomized males. Recipient females are mated at the same time as donor females. 
Embryos then are transferred surgically. The procedure for generating transgenic rats is 
similar to that of mice (Hammer et ai, Cell 63:1099-1 112, 1990). 

Methods for the culturing of embryonic stem (ES) cells and the subsequent 

25 production of transgenic animals by the introduction of DNA into ES cells using methods 

such as electroporation, calcium phosphate/DNA precipitation and direct injection also are 
well known to those of ordinary skill in the art (Teratocarcinomas and Embryonic Stem 
Cells, A Practical Approach, EJ. Robertson, ed., IRL Press, 1987). 

In cases involving random gene integration, a clone containing the sequence(s) of 

30 the invention is co-transfected with a gene encoding resistance. Alternatively, the gene 

encoding neomycin resistance is physically linked to the sequence(s) of the invention. 
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Transfection and isolation of desired clones are canned out by any one of several methods 
well known to those of ordinary skill in the art (E.J. Robertson, supra). 

DNA molecules introduced into ES cells can also be integrated into the 
chromosome through the process of homologous recombination (Capecchi, Science 244: 
5 1288-1292, 1989). Methods for positive selection of the recombination event (i.e., neo 

resistance) and dual positive-negative selection (i.e., neo resistance and gancyclovir 
resistance) and the subsequent identification of the desired clones by PCR have been 
described by Capecchi, supra and Joyner et al. (Nature 338: 153-156, 1989), the teachings 
of which are incorporated herein in their entirety including any drawings. The final phase 

1 0 of the procedure is to inject targeted ES cells into blastocysts and to transfer the 

blastocysts into pseudopregnant females. The resulting chimeric animals are bred and the 
offspring are analyzed by Southern blotting to identify individuals that carry the transgene. 
Procedures for the production of non-rodent mammals and other animals have been 
discussed by others (Houdebine and Chourrout, supra; Purse! et al, Science 244:1281- 

15 1288, 1989;andSimmse/fl/.,Bio/Technology6:179-183, 1988). 

Thus, the invention provides transgenic, nonhuman mammals containing a 
transgene encoding a kinase of the invention or a gene effecting the expression of the 
kinase. Such transgenic nonhuman mammals are particularly useful as an in vivo test 
system for studying the effects of introduction of a kinase, or regulating the expression of 

20 a kinase (i.e., through the introduction of additional genes, antisense nucleic acids, or 

ribozymes). 

A "transgenic animal" is an animal having cells that contain DNA which has been 
artificially inserted into a cell, which DNA becomes part of the genome of the animal 
which develops from that cell. Preferred transgenic animals are primates, mice, rats, 
25 cows, pigs, horses, goats, sheep, dogs and cats. The transgenic DNA may encode human 

STE20-related kinases. Native expression in an animal may be reduced by providing an 
amount of anti-sense RNA or DNA effective to reduce expression of the receptor. 

IX. Gene Therapy 

30 Protein kinases of the invention, or their genetic sequences will also be useful in 

gene therapy (reviewed in Miller, Nature 357:455-460, 1992). Miller states that advances 
have resulted in practical approaches to human gene therapy that have demonstrated 
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positive initial results. The basic science of gene therapy is described in Mulligan 
(Science 260:926-931, 1993). 

In one preferred embodiment, an expression vector containing protein kinase 
coding sequence is inserted into cells, the cells are grown in vitro, and then are infused in 
5 large numbers into patients. In another preferred embodiment, a DNA segment containing 

a promoter of choice (for example a strong promoter) is transferred into cells containing 
an endogenous gene encoding kinases of the invention in such a manner that the promoter 
segment enhances expression of the endogenous kinase gene (for example, the promoter 
segment is transferred to the cell such that it becomes directly linked to the endogenous 

10 kinase gene). 

The gene therapy may involve the use of an adenovirus containing kinase cDNA 
targeted to a tumor, systemic kinase increase by implantation of engineered cells, injection 
with kinase-encoding virus, or injection of naked kinase DNA into appropriate tissues. 
Target cell populations may be modified by introducing altered forms of one or 

15 more components of the protein complexes in order to modulate the activity of such 

complexes. For example, by reducing or inhibiting a complex component activity within 
target cells, an abnormal signal transduction event(s) leading to a condition may be 
decreased, inhibited, or reversed. Deletion or missense mutants of a component, that 
retain the ability to interact with other components of the protein complexes but cannot 

20 function in signal transduction may be used to inhibit an abnormal, deleterious signal 

transduction event. 

Expression vectors derived from viruses such as retroviruses, vaccinia virus, 
adenovirus, adeno-associated virus, herpes viruses, several RNA viruses, or bovine 
papilloma vims, may be used for delivery of nucleotide sequences (e.g., cDNA) encoding 

25 recombinant kinase of the invention protein into the targeted cell population (e.g., tumor 
cells). Methods which are well known to those skilled in the art can be used to construct 
recombinant viral vectors containing coding sequences (Maniatis el al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1989; Ausubel et 
al, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 

30 Interscience, N.Y., 1989). Alternatively, recombinant nucleic acid molecules encoding 

protein sequences can be used as naked DNA or in a reconstituted system e.g., liposomes 
or other lipid systems for delivery to target cells (e.g., Feigner et al., Nature 337:387-8, 
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1989). Several other methods for the direct transfer of plasmid DNA into cells exist for 
use in human gene therapy and involve targeting the DNA to receptors on cells by 
complexing the plasmid DNA to proteins (Miller, supra). 

In its simplest form, gene transfer can be performed by simply injecting minute 
5 amounts of DNA into the nucleus of a cell, through a process of microinjection (Capecchi, 

Cell 22:479-88, 1980). Once recombinant genes are introduced into a cell, they can be 
recognized by the cell's normal mechanisms for transcription and translation, and a gene 
product will be expressed. Other methods have also been attempted for introducing DNA 
into larger numbers of cells. These methods include: transfection, wherein DNA is 

10 precipitated with CaP0 4 and taken into cells by pinocytosis (Chen et aL, Mol. Cell Biol. 

7:2745-52, 1987); electroporation, wherein cells are exposed to large voltage pulses to 
introduce holes into the membrane (Chu et al. 9 Nucleic Acids Res. 15:131 1-26, 1987); 
lipofection/liposome fusion, wherein DNA is packaged into lipophilic vesicles which fuse 
with a target cell (Feigner et ai., Proc. Natl. Acad. Sci. USA. 84:7413-7417, 1987); and 

1 5 particle bombardment using DNA bound to small projectiles (Yang et ai , Proc. Natl. 

Acad. Sci. 87:9568-9572, 1990). Another method for introducing DNA into cells is to 
couple the DNA to chemically modified proteins. 

It has also been shown that adenovirus proteins are capable of destabilizing 
endosomes and enhancing the uptake of DNA into cells. The admixture of adenovirus to 

20 solutions containing DNA complexes, or the binding of DNA to polylysine covalently 

attached to adenovirus using protein crosslinking agents substantially improves the uptake 
and expression of the recombinant gene (Curiel et ai, Am. J. Respir. Cell. Mol. Biol., 
6:247-52, 1992). 

As used herein "gene transfer" means the process of introducing a foreign nucleic 
25 acid molecule into a cell. Gene transfer is commonly performed to enable the expression 

of a particular product encoded by the gene. The product may include a protein, 
polypeptide, anti-sense DNA or RNA, or enzymatically active RNA. Gene transfer can be 
performed in cultured cells or by direct administration into animals. Generally gene 
transfer involves the process of nucleic acid contact with a target cell by non-specific or 
30 receptor mediated interactions, uptake of nucleic acid into the cell through the membrane 
or by endocytosis, and release of nucleic acid into the cytoplasm from the plasma 
membrane or endosome. Expression may require, in addition, movement of the nucleic 
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acid into the nucleus of the cell and binding to appropriate nuclear factors for 
transcription. 

As used herein "gene therapy" is a form of gene transfer and is included within the 
definition of gene transfer as used herein and specifically refers to gene transfer to express 
5 a therapeutic product from a cell in vivo or in vitro. Gene transfer can be performed ex 

vivo on cells which are then transplanted into a patient, or can be performed by direct 
administration of the nucleic acid or nucleic acid-protein complex into the patient. 

In another preferred embodiment, a vector having nucleic acid sequences encoding 
a protein kinase polypeptide of the invention is provided in which the nucleic acid 
10 sequence is expressed only in specific tissue. Methods of achieving tissue-specific gene 

expression are set forth in International Publication No. WO 93/09236, filed November 3, 
1992 and published May 13, 1993. 

In all of the preceding vectors set forth above, a further aspect of the invention is 
that the nucleic acid sequence contained in the vector may include additions, deletions or 
15 modifications to some or all of the sequence of the nucleic acid, as defined above. 

In another preferred embodiment, a method of gene replacement is set forth. 
"Gene replacement" as used herein means supplying a nucleic acid sequence which is 
capable of being expressed in vivo in an animal and thereby providing or augmenting the 
function of an endogenous gene that is missing or defective in the animal. 
20 X. Administration of Substances 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 
WO 96/22976, published August 1 1996, both of which are incoiporated herein by 
25 reference in their entirety, including any drawings, figures, or tables. Those skilled in the 

art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used, and the size and physiological condition of 
30 the patient. Therapeutically effective doses for the compounds described herein can be 

estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
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takes into account the IC50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 
tumors, and major organs can be also be determined to facilitate the selection of drugs 
5 most appropriate to inhibit a disorder. Such measurements can be carried out. For 

example, HPLC analysis can be performed on the plasma of animals treated with the drug 
and the location of radiolabeled compounds can be determined using detection methods 
such as X-ray, CAT scan, and MRL Compounds that show potent inhibitory activity in 
the screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

10 altering the chemical structure and retesting. In this regard, compounds displaying good 

pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 1) 
the compound is administered to mice (an untreated control mouse should also be used); 

1 5 2) blood samples are periodically obtained via the tail vein from one mouse in each 

treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition, and the percent of lymphocytes versus polymorphonuclear cells. 
A comparison of results for each dosing regime with the controls indicates if toxicity is 
present. 

20 At the termination of each toxicity study, further studies can be carried out by 

sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinary Medical Assoc., 202:229-249, 1993). 
Representative animals from each treatment group can then be examined by gross 

25 necropsy for immediate evidence of metastasis, unusual illness, or toxicity. Gross 

abnormalities in tissue are noted, and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 
effect the less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 

Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

EXAMPLES 

The examples below are not limiting and are merely representative of various 
aspects and features of the present invention. The examples below demonstrate the 
isolation and characterization of the protein kinases of the invention. 

EXAMPLE 1 : Isolation of cDNA clones Encoding Novel Mammalian Protein Kinases 
Materials and Methods Identification from cDNA databases and isolation of clones 
encoding novel protein kinases 

Novel kinases were identified from the public EST databases using a Hidden 
Markov model, abbreviated HMM (Kxogh, A., Brown, M., Mian, 1. S., Sjolander, K., and 
Haussler, D. 1994. Hidden Markov models in computational biology: Applications to 
protein modeling. J. Mol Biol, 235:1501-1531). The model was built with 70 
mammalian and yeast kinase catalytic domain sequences. These sequences were chosen 
from a comprehensive collection of kinases such that no two sequences had more than 
50% sequence identity. ESTs were translated in six open reading frames and were 
searched against the model. ESTs that had a score of at least 10 against the HMM were 
then masked for repetitive sequences and vectors and were clustered using MSA. The 
resulting contigs were searched against known kinases to identify EST clones that encode 
novel kinases. 

Approximately 40% of the ESTs encoding potentially novel kinases did not 
correspond to the correct EST upon sequence analysis. Most of these discrepancies were 
resolved by ordering additional clones, however, 14 remained unavailable. These 14 ESTs 
were amplified from a variety of single-stranded cDNA sources with primers derived from 
the corresponding EST entry as shown on Table 5. The PCR product was subcloned into a 
bluescript vector, digested to confirm the presence of a correct size insert and sequenced. 
Full sequencing of EST and PCR was carried out using a cycle sequencing Big-dye kit 
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with AmpliTaq DNA Polymerase, FS (ABI, Foster City, CA). Sequencing reaction 
products were run on an ABI Prism 377 DNA Sequencer, 

Table 5: Primers used to clone PCR products corresponding to novel kinases 





ID# 


ID# 


Parent 


5' primer 


3* primer 


sp 


na 


aa 


Sequence 


Sequence* 


Sequence* 


H 


33 


153 


2R22-5-11 


GAGATCGRNTTYAARGA 
RTTYGA 


TGTCACNCCNAGNSWCCAN 
AYRTT 


M 


81 


200 


5R57J0_2_ 
m TESK2_m 


GCTGCTGGACAGTGACT 
TGTATTT 


GAAAGCAAAGCCTTCACAC 
CTT 


H 


67 


187 


5R69J7_2_h 


CTCTCACCTCAGGAACT 
GG 


GCTTGCGGATCTTCTCA 


H 


46 


166 


SGK309_h 


GACATCCTGCCGGCCAA 
CTACG 


CGGCCCTGGAGCTGCATCA 
CTA 


M 


67 


228 


5R72_16_2_h 


TGCGCGACACCATTGAC 
CAG 


CTCAGGGCTTACATACAGA 
G 


H 


45 


165 


5R72_8_2_h 


AAAGGAGAACTACATTT 
TGAAAAT 


CTTCATCATCTCTAATACAT 
TGGTTGG 


H 


41 


161 


Z36720 


CAAATTAAGATCATTGA 
CTTTGGG 


GGAAACAAAGTCCTTGGCC 
TC 


H 


115 


234 


AL031652- 
Pak6 


GTGGACATCTGGTCCCT 
CG 


GTAGGTCCTTCACTCTTGG 
AG 



• degenerate oligonucleotide residue designation: 
N= A,C,G ot T 



R= A or G 
Y= C or T 
S = C or G 
W= A or T 

Full-length sequence extension of protein kinases using cDNA and genomic databases 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was carried out by iterative blastn searching of the cDNA databases listed in Table 
6. All blastn searches were conducted using a blosum62 matrix, a penalty for a nucleotide 
mismatch of -3 and reward for a nucleotide match of 1 . The gapped blast algortihm is 
described in: (Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui 
Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and 
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PS1-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 
25:3389-3402). 

Table 6. Databases used for cDNA-based sequence extensions 



Database 




LifeGold templates 


Feb 2000 


LifeGold compseqs 


Feb 2000 


LifeGold compseqs 


Mar 2000 


LifeGold compseqs 


Apr 2000 


LifeGold fl 


Feb 2000 


LifeGold flft 


Apr 2000 


NCBI human Ests 


May 2000 


NCBI murine Ests 


May 2000 


NCBI nonredundant 


May 2000 



5 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was also carried out by iterative searches of genomic databases. Three methods 
were used. The first method made use of the Smith- Waterman algorithm to carry out 
protein-protein searches of the closest homologue or orthologue to the partial kinase. The 

10 target databases consisted of Genescan and open-reading frame (ORF) predictions of all 

human genomic sequence derived from the human genome project (HGP) as well as from 
Celera. The complete set of genomic databases searched is shown in Table 7 below. 
Genomic sequences encoding potential extensions were further assessed by blastp analysis 
against the NCBI nonredundant to confirm the novelty of the hit. The extending genomic 

15 sequences were incorporated into the cDNA sequence after removal of potential introns 

using the Seqman program from DNAStar. The default parameters used for Smith- 
Waterman searches were as shown next. Matrix: blosum 62; gap-opening penalty: 12; gap 
extension penalty: 2. Genescan predictions were made using the Genescan program as 
detailed in (Chris Burge and Sam Karlin "Prediction of Complete Gene Structures in 

20 Human Genomic DNA", JMB (1997) 268(l):78-94). ORF predictions from genomic 

DNA were made using a standard 6-frame translation. 
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The second method for genomic sequence-based extensions made use of tBlastn 
searches of the homologue or orthologue to the partial kinase against the cDNA databases 
listed in Table 7. The recognition of significant hits in these databases made possible to 
identify bridging partial cDNA clones. The iterative application of the two methods made 
5 possible the assemblage of the virtual full-length sequence for a large number of the 

kinases presented in this application. All tblastn searches were conducted using a 
blosum62 matrix, a penalty for a nucleotide mismatch of -3 and reward for a nucleotide 
match of 1 . 

The last method for defining cDNA extensions from genomic sequence used 
10 iterative searches of genomic databases through the Genescan program to predict exon 

splicing and the Genewise program (http://www.sanger.ac.uk/SoftwareAVise2/ ) to predict 
potential ORFs based on homology to the closest orthologue/homologue. 
Table 7. Databases used for genomic-based sequence extensions 



Database 


Number of entries 


Database Date 


Celera v. 1-5 . 


5,306,158 


Jan 19/00 


Celera v. 6-10 


4,209,980 


Mar 24/00 


Celera v. 11-14 


7,222,425 


Apr 24/00 


Celera v. 1 5 


243,044 


May 14/00 


HGP all Genescan 


25,885 


Apr 04/00 


HGP; Phase 0 


4,944 


May 04/00 


HGP; Phase 1 


28,478 


May 05/00 


HGP; Phase 2 


1,508 


May 04/00 


HGP; Phase 3 


9,971 


May 05/00 



Virtual Extensions 

Human AA826850 (SEQ ID NO: 3, SEQ ID NO: 1 24) 
Blastn analysis of the partial AA826850 sequence revealed an extension to 
encompass the complete ORF in the Lncyte EST 238299.1 . A frame-shift correction at 
position 595 of this EST (marked by X in NA sequence) generated an uninterrupted ORP. 
Human AA960957 (SEQ ID NO: 4, SEQ ID NO: 125) 



15 



20 
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Since the initial filing of this application, the partial AA960957 sequence appeared 
in the public database as the full-length gene for a protein kinase encoded by a gene that 
maps adjacent to the eve (AJ250839) (ellis-van creveld syndrome and weyers acrodental 
dysostosis) gene from 4pl6.1. 
5 Human 5R79-46-l_h (SEQ ID NO: 5, SEQ ID NO:126) 

Blastn analysis of the partial 5R79-46-1 sequence revealed an extension to 
encompass the complete ORF in the Incyte EST 463894.6. Since the initial filing of this 
application, the full-length virtual 5R79-46-1 appeared in the public database as the full- 
length gene for the TANK-binding kinase (TBK1) (Pomerantz,J.L. and Baltimore,D. 
10 (1999) EMBO J. 18 (23), 6694-6704). TBK1 participates in NF-kB activation through the 

formation of a signaling complex with TRAF2 and TANK. 

Human AA305176 (SEQ ID NO: 6, SEQ ID NO: 127) 

Blastn analysis of the partial AA3051 76 sequence revealed an extension to 
encompass the complete ORF in the Incyte EST 220937.1. 
15 Human AA256100 (SEQ ID NO: 8, SEQ ID NO:129) 

Blastn analysis of the partial AA256100 sequence revealed an extension to 
encompass the complete ORF through the assembly of three partial clones: Incyte EST 
480815.6, K1AA0965 (BAA76809) and AA256100. 

Human AA2 10825 (SEQ ID NO: 9, SEQ ID NO: 130) 
20 Blastn analysis of the partial AA21 0825 sequence revealed an extension to 

encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
EST 014721.7, and the NCB1 EST's AW01 158 and AA210825. An insertion of two "N's" 
at positions 1915 and 1916 generated an uninterrupted ORF. Blastx analysis indicated the 
possibility of a start Met in the range of 400-450 nucleotides (i.e. compared to the closest 
25 homolog, human PKCmu (CAA53384.1). However, no Met was found in this region; 

rather ORF ends in an in-frame stop preceeded by the sequence 

"RGLLAPGDPPCPPPNPAPATPPSSRLPTELFSNFCDS'\ It is possible that part of the 
sequence covered by nucleotide positions 1-400 derived from AW01 158 comes from an 
intron, explaining the absence of a start Met. 
30 Human AA 127299 (SEQ ID NO: 10, SEQ ID NO: 131) 

No entries in the database extended this sequence. The 1684 bp insert of this EST 
contains a 1369 bp intron at the 3' end. Blastx and SW analysis of the 315 bp coding 
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region revealed homology to the extracatalytic C2 domain of PKC. This EST, may or may 
not encode a kinase. 

Human AA3 16804 (SEQ ID NO:l 1, SEQ ID NO: 132) 

Since the initial filing of this application, the partial AA3 16804 sequence appeared 
5 in the public database as the full-length gene for the PKC family protein kinase EPK2 or 

PKCnu (AB015982). 

Human H 1 9 1 02 (SEQ ID NO: 1 4, SEQ ID NO: 1 35) 

Genewise and Genescan analyses of the partial HI 91 02 sequence revealed an 
extension from the HGP phase 3 contig 3810672 to encompass the complete catalytic 

10 domain of this EST. Blastn analysis against the non-redundant database revealed that this 

gene is found in the cosmid AC005726 from chromosome 17. HI 91 02 may encode a dual 
catalytic kinase given the homology to S6 kinase. Analysis of genomic sequence upstream 
of the 5' end of HI 91 02 revealed a non-kinase gene oriented in the same polarity as 
H 1 9 1 02 suggestive of the start Met for H 1 91 02 being close to the 5 ' end of the H 1 9 1 02 

15 sequence. From this analysis it is deduced that the second catalytic domain of HI 91 02, if 

present, is most likely located within the 47334-185,215 bp region of the genomic 
sequence of AC005726. 

Human AA476563 (SEQ ID NO: 15, SEQ ID NO: 136) 

Since the initial filing of this application, the partial AA476563 sequence appeared 
20 in the public database as the full-length gene for the protein kinase RPS6KC1 

(NMJH2424) (Zhang, H. et al Genomics (1999) 61, 314-318), which is an S6 kinase 
mapping to 12ql2-ql3.1. 

Human AA626690 (SEQ ID NO: 1 6, SEQ ID NO: 1 37) 

Since the initial filing of this application, the partial AA626690 sequence appeared 
25 in the public database as the full-length gene for the protein kinase RPS6KA6 (AF1 84965) 

(Yntema,H.G et al (1999) Genomics 62, 332-343), an S6 kinase commonly deleted in 
patients with complex X-linked (Xq21.1 ) mental retardation. 
Human A121 5680 (SEQ ID NO: 1 7, SEQ ID NO: 1 38) 

Since the initial filing of this application, the partial AI2 15680 sequence appeared 
30 in the public database as the full-length gene encoding a hypothetical protein (AAD30182) 
from the locus AC006530.4 from chromosome 14. 

Human AA887783 (SEQ ID NO:21, SEQ ID NO:142) 
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Blastn analysis of the partial AA887783 sequence revealed an extension to 
encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
415390R6 and the NCBI EST's AA887783 and N94726. Since the initial filing of this 
application, the nearly full-length virtual AA887783 sequence appeared in the public 
5 database as the full-length gene encoding SGK3 (AF1 69035), a serum- and 

glucocorticoid-induced protein kinase (Kobayashi,T. et a] (1999) Biochemical J. 344, 189- 
197. 

Human R47805 (SEQ ID NO:22, SEQ ID NO:143) 

A cDNA clone encoding the full-length ORF of R47805 was isolated using 
10 R47805 as a screening probe. A full-length form for R47805 has also appeared in the 
public database as 

PTK9L (NM_007284), an A6-related protein kinase. 
Human H60215 (SEQ ID NO:23, SEQ ID NO:144) 

Blastn analysis of the partial H60215 sequence revealed an extension to encompass 
15 the complete ORF in the public EST AI275726. This was confirmed through the full insert 

sequencing of this EST (2,310 bp) which corresponds to the sequence under SEQ ID 
NO: 144. 

A different stop codon was predicted for AI275726 compared to H60215 due to a 
single nucleotide insertion at position 1586 in AI275726. Evidence for the extra nucleotide 
20 comes from EST All 91 922. 

SGK324_h orthologue of W30246_m (SEQ ID NO:24 , SEQ ID NO: 145) 
Blastn, blastx and Smith-Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding to the human orthologue of 
murine W30246. Exons predicted from the following sequences were used for contig 
25 construction: Celera 17000189645083, 17000057549105 and 11000501939981; 

Incytel42404.1,HGP_7249119, Incyte 71 96489H1, Celera 1 1000501939981, 
17000028165594; Incyte 72491 19^3, Celera 17000035772368, 1 1000502081575 and 
17000140274329. The latter Celera sequence provides the N-terminus. 
Human AA383293 (SEQ ID NO:26, SEQ ID NO: 147) 
30 Blastn, blastx and Smith-Waterman analyses of genomic databases revealed an 

extension to encompass the complete ORF corresponding for AA383293. Exons predicted 
from the following sequences were used for contig construction: (numbers in parenthesis 
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refer to the aa sequence of the closest homolog (RU2S, NP_057440) used for the Smith- 
Waterman query): N-term from Incyte 6010175_2 (14-97), Incyte 6981981 (134-184) 
7596749 (186-232) Celera 17000020789545 (243-301) CAB75619.1 (310-341)-(56-145 
DCX homology) 6010175_2 , Celera 17000030058129 (241-262 DCX homology). 
5 Human AA021445 (SEQ ID NO:32, SEQ ID NO: 152) 

Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. Contig reconstruction was as follows: nucleotides! -802 
from KIAA0999 (AB023216); nucleotides 803-4321 from full-insert sequence of 
AA021445. A pairwise alignment between the AA021445 and K1AA0999 revealed three 
10 inserts in the extracatalytic C-terminus of 48, 48 and 161 aminoacids. In addition, both 

AA021445 and KIAA0999 have 15 copies of a CAG repeat. Trinucleotide repeats are 
often found in genes that linked to neurodegenerative diseases. 
Human 2R22-55-1 (SEQ ID NO:33, SEQ ID NO: 153) 
Blastn analysis revealed an extension in the Incyte EST clone 321074.1 to 
1 5 encompass the complete ORF corresponding to 2R22-55-1 . 

Human orthologue of AA544838_m (SEQ ID NO:36, SEQ ID NO:l 56) 
tBlastn analysis identified the partial human K1AA0135 (U79240) clone as the 
human orthologue of murine AA544838. Blastn revealed an extension KlAA0135_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from 
20 lncyte406786.5 5 KFZp430051 and KIAA0135 (U79240). 

Human orthologue of A1785735_m (SEQ ID NO:38, SEQ ED NO: 1 58) 
tBlastn analysis identified the partial human KJAA0781 (AB018324) clone as the 
human orthologue of murine AI785735. Blastn revealed an extension KIAA0135_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from Incyte 
25 986123.37 KIAA0781 (AB018324). 

Human AA207220 (SEQ ED NO: 39, SEQ ID NO: 159) 
Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. The full ORF was reconstructed from Incyte 402740.1 and 
AA207220. Frame corrections: deletion of 441 and 595 over Inc402740.1 seq based on 
30 blastx to keep frame open; two n insertions 940, 941 over AA207220 to keep frame open. 
Human AA426580 (SEQ ID NO:40, SEQ ED NO: 160) 
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Since the initial filing of this application, the partial AA426580 sequence appeared 
in the public database as the full-length gene encoding MAK-V (AJ271722) from 
chromosome 21q22.1. 

Human 5R79-54-1 (SEQ ED NO: 41, SEQ ID NO: 161) 
5 Genewise and Genescan analyses of the partial 5R79-54-1 sequence revealed an 

extension from genomic sequence to encode the full ORF for 5R79-54-1. 

Human orthologue of AA54201 5_m (SEQ ED NO: 42, SEQ ID NO: 1 62) 

tBlastn analysis identified KJAA1297 (AB037718). Blastn extended the 
K1AA1297 sequence to provide the C-terminus through the Incyte 224074.1 EST. The 
10 partial ORF consists of a dual catalytic domain flanked by 6 Ig domains and 2 fibronectin 

repeats. Based on homology to the bt drosophila protein (AAF59316.1), the human form 
of AA542015 is expected to be missing 16 Ig domains. 

Human R19772 (SEQ ID NO:44, SEQ ID NO: 164) 

The full-length ORF for R19772 was isolated by screening a cDNA library using a 
15 probe derived from R 19772. Since the initial filing of this application, the R 19772 

sequence appeared in the public database as the full-length gene encoding Trio (Duet) 
(AB01 1422). CDNA library screening revealed multiple iso forms for this gene which are 
summarized in the Table below. 

20 TableS. Isoforms for Rl 9772 



Kestrl Name 


Kestrl 
AA Acc # 


Isoform 
type 


Source 


Description* 


Trad (Duet) 


R19772 


B 


Skeletal 
muscle 


Deletion of K at 124 










Deletion of Q at 616 










Substitution of E for G at 
762 






C 


Skeletal 
muscle 


Deletion of K at 124 










Deletion of Q at 616 










Substitution of E for G at 
762 
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Deletion of 32 aa (160-191) 






D 


Lung tumor 


Deletion of Q at 616 










Deletion of 32 aa (160-191) 






E 


Lung tumor 


Deletion of Q at 616 










Deletion of 32 aa (160-191) 



* reference amino acid position are with respect to sequence of Trad (AB01 1422) 



Human AA435956 (SEQ ID NO:48, SEQ ID NO: 168) 
5 Blastn analysis revealed an extension to encompass the nearly complete catalytic 

region of AA435956. 5' end sequence extension was provided by genomic locus 
AC007242.3Ji (range 44880-43801). Based on blastx analysis, the extended sequence 
encodes is full-length at the C-terminus. 

Human AA397553 (SEQ ID NO: 51, SEQ ID NO: 1 7 1 ) 
10 Since the initial filing of this application, the partial AA397553 sequence appeared 

in the public database as the full-length gene encoding CRK7 (AF227198), a novel CDC2- 
related protein kinase that colocalizes with interchromatin granule clusters. 

Human AA789239 (SEQ ID NO: 52, SEQ ID NO: 1 72) 

Since the initial filing of this application, the partial AA789239 sequence appeared 
1 5 in the public database as the full-length gene encoding NK1AMRE (AF1 30372), a novel 

kinase deleted in human leukemia. 

Human AA631990 (SEQ ID NO:55, SEQ ED NO:I75) 
Blastn analysis revealed an extension to encompass the full-length ORF for 
AA631990. The full ORF was reconstructed from 253847.5 and AA631990 and 
20 AA207220. Frame corrections: delete 1 C at 1380, delete 2N's at 2033/2034. 
Human AA557536 (SEQ ID NO:56, SEQ ID NO: 176) 
Blastn analysis revealed an extension to encompass full-length ORF for 
AA557536. The full ORF was reconstructed from AA557536, celera 1 1000504061899 
and the Incyte 097089.1 EST. An 85bp intron was removed from AA557536. 
25 Human N34132 (SEQ ID NO: 63, SEQ ED NO: 183) 

Full sequencing of EST N34132 (1 .3 kb) confirmed that this cDNA encodes a 
novel NEK-subfamily kinase. Blast analysis against the EST database showed that four 
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EST sequences (AA283140, AA283140, AA28291 1 and N5301 1) extended the sequence 
ofN34132 at the 3' end to form a 2.31 kb contig. Blast analysis of the new contig against 
the nonredunat public database showed that the N34132 extended contig overlapped 
(100% identity) over 228 bp at its 3' end with human KIAA0344 (AB002342), a 5, 787 bp 

5 cDNA encoding a 1 246 aa polypeptide. The 5' 790 bp of the K1AA0344 cDNA (encoding 
the 58 N-terminal protein sequence) were found to be divergent with respect to the 
extended 2.32 kb N34132 contig. Evidence that the extended N34132 contig (2.3 lkb) and 
K1AA0344 (AB002342) belong to the same gene is the following. First, blast analysis of 
the nucleotide sequences for N34132 and KIAA0344 against the NRM database confirmed 

10 that these cDNA's are transcribed from the same genomic locus defined by two 

overlapping BACs (AC004765 and AC004803) from chromosome 12pl3.3. Second, full 
sequence determination of a PCR fragment amplified from single-stranded cDNA 
confirmed the junction between the extended N34132 contig and KlAA0344_h 
(AB002342). The 462 PCR product was amplified with primers 

1 5 CTCCTCAACAGACAGTGCAG (5 ' primer) and G ACATTCTACTACTCGGTCTC (3 ' 
primer) designed from the N34132 extended contig and K1AA0344 sequences, 
respectively. The region of N34132 containing the start Met was isolated by PCR from a 
testis cDNA library (Clontech). 

Human 5R69-17-2 (SEQ ID NO:67, SEQ ID NO: 187) 

20 The full-length ORF for 5R69-17-2 was isolated by screening a cDNA library 

using a probe derived from 5R69-1 7-2. 

Human H8581 1 (SEQ ID NO:68, SEQ ID NO:188) 

Tblastn, Smith- Waterman and blastn analyses using cDNA databases revealed an 
extension to encompass full-length ORF for H8581 1 . The full ORF was reconstructed 
25 from IncyteESTs 202971.8, 034583.3 and 034583.1 and public ESTs H8581 1 and 
AI570599. 

Human R43524 (SEQ ID NO:73, SEQ ID NO: 192) 

Blastn analysis revealed an extension to encompass the complete catalytic region 
and the C-terminus of R43524. Since the initial filing of this application, the partial 
30 R43524 sequence appeared in the public database as the full-length gene encoding the 
heme-regulated initiation factor 2-alpha kinase (HRI) (AF1 81071). 

Human AA088547 (SEQ ID NO:78, SEQ ID NO: 1 97) 
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Genewise and Genescan analyses of genomic databases revealed an extension to 
encompass the complete ORF for AA088547. 

Human orthologue of AA139478jn (SEQ ID NO:80, SEQ ID NO:199) ■ 
Tblastn identified the lncyte 21 1475.1 as the potential full-length human 
5 orthologue of murine AA 139478 

Human AA232253 (SEQ ID NO:82, SEQ ID NO:201) 

The full-length ORF for AA232253 was isolated by screening a cDNA library 
using a probe derived from AA232253. Since the initial filing of this application, the 
AA232253 sequence appeared in the public database as the full-length gene encoding SLK 
10 (AB01 1422). SLK is a stress-regulated mixed lineage kinase-like protein that activation 
of Rac and induction of apoptosis. cDNA library screening revealed multiple isoforms for 
this gene which are summarized in the Table below. 

Table 9. Isoforms for AA232253 

15 



Kestrl 
Name 


Kestrl AA 
Acc# 


Isoform 
type 


Description* 


MLK4 


AA232253 


MLK4 


Substitution of C for W at 346 






MLK4B 


Different Cterm (332-800); seq in MLK4B is as 
shown in * 



* C-terminus specific to MLK4B 

LPLAARMSEESYFESKTEESNSAEMSCQITATSNGEGHGMNPSLQAMMLMGFGD1 
FSMNKAGAVMHSGMQJNMQAKQNSS 
20 KTTSKRRGKKVNMALGFSDFDLSEGDDDDDDDGEEEDNDMDNSE 

Human H97685 (SEQ ID NO:84, SEQ ID NO:203) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
H97685. The full ORF was reconstructed from lncyte 474824.1 and the public ESTs 
25 H97685 andM62021. 

Human A1052250 (SEQ ID NO:87, SEQ ID NO:206) 
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Blastn analysis revealed an extension to encompass the full-length ORF for 
A1052250. The fiill ORF was reconstructed from Incyte 396868.1, the public partial 
cDNA FLJ 10074 (minus intron) and the public ESTs and the public ESTs AI052250 and 
H97685, A1499220 and M62021. 
5 Human AA278842 (SEQ ID NO:88, SEQ ID NO:206) 

A nearly full-length cDNA (FL4F12) for AA278842 was isolated by screening a 
cDNA library using a probe derived from AA278842. A full-length virtual ORF was 
generated using FL4F12 and AA278842. 

Human AA599286 (SEQ ID NO:89, SEQ ID NO:208) 
10 Since the initial filing of this application, the partial AA599286 sequence appeared 

in the public database as a full-length ORF (AK000342). 

Human AA425725 (SEQ ID NO:90, SEQ ID NO:209) 

Since the initial filing of this application, the partial AA425725 sequence appeared 
in the public database as MSSK1, a serine kinase gene located from human chromosome 
15 Xq28. 

Human SGK022 orthologue of AA060026_m (SEQ ID NO:91, SEQ ID NO:210) 

Tblastn, Smith-Waterman and blastn analyses of cDNA and genomic databases 
databases revealed a potential human orthologue for murine AA060026. The full-length 
ORF for SGK022 was reconstructed from genomic locus AC022307. 
20 Human AA399669 (SEQ ED NO:93, SEQ ID NO:212) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA399669. The full ORF was reconstructed as follows: sequence 1-1007 from 
AL136295.2; sequencel 008-23 19 from AA399669 and Incyte 428177.1. 

Human AA883975 (SEQ ID NO:95, SEQ ID NO:214) 
25 Genescan and Genewise analyses of the genomic databases revealed an extension 

for AA883975 to encompass the full-length ORF 

Human AA905446 (SEQ ID NO:96, SEQ ID NO:215) 

Tblastn, Smith-Waterman and blastn analyses of cDNA and genomic databases 
databases revealed an extension for AA905446 to encompass the full-length ORF. For the 
30 Smith-Waterman analysis murine STK22 ( NP_033462) was used as the closest 

orthologue. Contig formation: range 162133-163687 from HGP__h 6921333_9; removed 
intron (1 46-893) predicted from blastx analysis. 
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Human H29974 (SEQ ID NO: 97 SEQ ED NO:216) 

Blastn analysis revealed an extension to encompass a complete catalytic ORF for 
AA399669. The nearly full-length ORF was reconstructed using Incyte 213829.1 and 
H29974. 

5 Human AA215311 (SEQ ID NO:99, SEQ ID NO:218) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA21531 . The full ORF was reconstructed from Incyte 067584.1, 022456.1, AA2 1531 1 
and the reverse complement of CPG_043208. 

Human AA018361 (SEQ ID NO: 100, SEQ ID NO:219) 
10 The full-length ORF for AA018361 was isolated by screening a cDNA library 

using a probe derived from AA01 8361 . This yielded clone Sug4-30. Clone Sug4-30, like 
multiple, independent cDNA clones contained a 181bp intron. The existence of intron-less 
RNA's was confirmed by a PCR reaction that generated a product that upon sequence 
analysis skipped the intron region. The full-length virtual. ORF for AA018361 was 
15 generated through a contig between AL1 17482 (seq 1-367) and the sequence for clone 

Sug4-30. 

Human orthologue of AA396601_m (SEQ ID NO:106 5 SEQ ED NO:225) 
tBlastn and Smith-Waterman analyses of genomic sequence revealed an extension 

to encompass the full catalytic region for the human orthologue of AA396601 . The ORF 
20 was reconstructed from Incyte 018653.9 (7261449H1, 6891740J1) and genomic sequence 

CPG_040010. 

Human orthologue of AA671275_m (SEQ ID NO:108, SEQ ID NO:227) 
Since the initial filing of this application, a potential human orthologue for murine 
AA671275 appeared in the public database as the full-length ORF for vaccinia related 
25 kinase 3 (BAA90769). 

Human H05721 (SEQ ID NO:l 1 1, SEQ ID NO:230) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for H05721 . 

Human AI086865 (SEQ ID NO:l 12, SEQ ID NO:231) 
30 Genescan and Genewise analyses of genomic sequence revealed an extension to 

encompass the full-length ORF for AI086865. The full-length ORF was reconstructed 
from Celera 17000102901516, Incyte 243269.1 and public AL1 377531 . 
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Human AA836348 (SEQ ID NO: 11 3, SEQ ID NO:232) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for AA836348. 

Human R86668 (SEQ ID NO: 14, SEQ ID NO:233) 

The full-length ORF for R86668 was isolated by screening a cDNA library using a 
probe derived from R86668. Since the initial filing of this application, the R8668 sequence 
appeared in the public database as the full-length gene mitogen-activated protein kinase 
kinase kinase 6 (MAP3K6) (NM_00467). 

Human 2R4 1-9-4 (SEQ ID NO: 16, SEQ ID N0.235) 

The fiill-length virtual ORF for 2R41-9-4 was generated using genomic sequence 
to provide the Nterminus for the partial ORF predicted from clone 2R41-9-4 

Table 10. Sequences deleted from the provisional patent due to duplication with other 
genes in the patent 



Prov. SEQ ID NO: (na) 


Prov. SEQ ID NO: (aa) 


160 


196 


213 


214 


215 


216 


122 


126 


119 


123 


148 


184 


4 


20 


7 


23 


205 


206 


14 


30 


15 


31 


35 


56 


42 


63 


51 


72 


44 


65 


77 


91 
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78 


92 


79 


93 


80 


94 


157 


193 



Results 

Table 1 documents the results from the analysis of the nucleic acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 
5 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 

derived from. "DD#" refers to the nucleic acid and amino acid sequence ED number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

10 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:18-22 

and Plowman G.D. et al (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "ORF Start", 
"ORF End", "ORF Length" refer to the open reading frame range and length as calculated 
by standard nucleic acid translation programs such as MapDraw (DNAStar). "DNA 
Repeats" refers to regions of low complexity sequence or repetitive elements such as Alu, 

15 LINE, SINE, and LTR sequences. The chromosomal location (CHR localization) for 37 

of the 1 10 novel protein kinases is shown on Table 1 (NA, not available). The methods 
for determining chromosomal position are outlined below, in Example 2. 

Table 2 documents the results from the analysis of the amino acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 

20 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 

derived from. "ID#" refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

25 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22: 1 8-22 

and Plowman G.D. et al (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "nraa Score", 
"ID match aa", "Identity", "Similar", "nraa Match Acc#" Description" refer to the data 
obtained using a Smith-Waterman search of the amino acid sequence against the non- 
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redundant protein database (Matrix: PamlOO; gap open/extension penalties 14/1). "Kinase 
Domain Start" "Kinase Domain End", "Profile Start" and "Profile End" refer to data 
obtained using a Hidden-Markov Model to define catalytic range boundaries. The profile 
has a length of 261 amino acids, corresponding to the complete protein kinase catalytic 
5 domain. Proteins in which the profile recognizes a full length catalytic domain have a 
"Profile Start" of 1 and a "Profile End" of 261 . The boundaries of the catalytic domain 
within the overall protein are noted in the "Kinase Domain Start" and "Kinase Domain 
End" columns. 

10 The following abbreviations were used for kinases: 



ASK 


Apoptosis signal-regulating kinase 


CaMK 


Ca2+/calmodulin-dependent protein kinase 


CCRX 


Cell cycle-related kinase 


CDK 


Cyclin-dependent kinase 


CK 


Casein kinase 


DAPK 


Death-associated protein kinase 


DM 


myotonic dystrophy kinase 


Dyrk 


dual-specificity-tyrosine phosphorylating-regulated kinase 


GAK 


Cyclin G-associated kinase 


GRK 


G-protein coupled receptor 


GuC 


Guanylate cyclase 


HffK 


Homeodomain-interacting protein 


IRAK 


lnterleukin-1 receptor-associated kin 


MAPK 


Mitogen activated protein kinase 


MAST 


Micotubule-associated STK 


MLCK 


Myosin-light chain kinase 


MLK 


Mixed lineage kinase 


NIMA 


NimA-related protein kinase 


PKA 


cAMP-dependent protein kinase 


RSK 


Ribosomal protein S6 kinase 


RTK 


Receptor tyrosine kinase 
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SGK Serum and glucocorticoid-regulated kinase 
STK serine threonine kinase 
ULK UNC-5 1 -like kinase 

The following abbreviations were used for species 



H 


Human 


M 


Murine 


R 


Rat 


FV 


Fowlpox virus 


MT 


M. thermoautotrophicum 


CE 


Caenorhabditis elegans 


DM 


Drosophila melanogaster 


OS 


Oryza sativa 


SP 


Schizosaccharomyces pombe 


TP 


Tetrahymena pyriformis 


PI 


Petunia inflata 


NC 


Neurospora crassa 


MSV 


Medicago sativa 


MSV 


Moloney murine sarcoma virus 


SA 


Squalus acanthias 


cs 


Cucumis sativus 


GM 


Glycine max 


LL 


Lilium longiflorum 


TV 


Trichomonas vaginalis 


MP 


Mycoplasma pneumoniae 


DD 


Dictyostelium discoideum 


SC 


Saccharomyces cerevisiae 


MT 


Methanobacterium thermoautotrophicum 
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Domain and Motif Identification 

A Hidden Markov model (HMM) (Krogh, A., Brown, M., Mian, I. S., Sjolander, 

K., and Haussler, D. (1994). Hidden Markov models in computational biology: 

Applications to protein modeling. J. Mol. Biol., 235:1501-1531) was used to identify, 
5 both catalytic and extracatalytic domains. Table 4 shows extra-catalytic domains that 

were identified using the HMM program. Other domains such as coiled-coil and pest 

motifs were identified as described next. 

Potential coiled-coil domains were identified using the COILS program 

(www.ch.embnet.org/software/C01LS_form.html). The matrix used was MTIDK with 
10 windows of 14, 21, 28 amino acids. Only regions scoring 0.5 or higher were considered to 

have potential coiled-coil domain region. 

Protein sequences containing potential pest motifs were identified using the 

program PESTfind (www.at.embnet.org/embnet/tools/bio/PESTfind/). PEST regions in 

proteins are by definition sequences that tend to be rich in proline, glutamic or aspartic 
1 5 acid, argininine and histidine; they have been associated with increased protein turnover 

rates (Rogers S. et al (1986) Science 234, 364-368. The algorithm defines PEST 

sequences as hydrophilic stretches of amino acids greater than or equal to 12 residues in 

length. Such regions contain at least one P, one E or D and one S or T. They are flanked 

by lysine (K), arginine (R) or histidine (H) residues, but positively charged residues are 
20 disallowed within the PEST sequence. PESTfind produces a score ranging form about -50 

to +50. By definition, a score above zero denotes a possible PEST region; a value greater 

than +5 defines a high probability that there is a PEST domain. 

Identification of potential coiled-coil domains and PEST domains in N34132 

Potential coiled-coil domains were identified in N34132 (SEQ ID NO: 183) using 
25 the COILS program. Only regions scoring 0.5 or higher were considered to have potential 

coiled-coil domain region. The amino acid positions within N34231 scoring for potential 

coil-coil regions are shown below. 
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Table 1 1 coiled-coil domains predicted for N34] 32 



Coiled-coil Region 


Amino acid range 


Length (aa) 


1 


124-147 


24 


2 


437-451 


15 


3 


495-526 


32 


4 


1,723-1,749 


27 



Potential PEST domains were identified in N34132 using PESTfind, a value 
5 greater than +5 defines a high probability that there is a PEST domain. The amino acid 

positions within N34132 scoring for potential PEST regions are shown below. 



Table 12 Potential Pest domains identified in N34132 



PEST Region 


Score 


Amino acid range 


Amino Acid Length 


1 


+ 4.91 


54-95 


42 


2 


+11.4 


537-570 


34 


3 


+31.08 


1293-1304 


12 


4 


+10.15 


1543-1565 


23 


5 


+ 6.17 


1698-1732 


35 



10 EXAMPLE 2. Chromosomal Localization of Novel Mammalian Protein Kinases 

Materials and Methods 

Several sources were used lo find information about the chromosomal localization 

of each of the genes described in this patent. First, the accession number for the nucleic 

acid sequence was used to query the Unigene database. The site containing the Unigene 
15 search engine is: http://www. ncbi.nlm.nih.gov/UniGene/Hs.Home.html. Information on 

map position within the Unigene database is imported from several sources, including the 

Online Mendelian Inheritance in Man (OMIM, 

http://www.ncbi.nlm.nih.gov/Omim/searchomim.html), The Genome Database 
(http://gdb.infobiogen.fr/gdb/simpleSearch.html), and the Whitehead Institute human 
20 physical map (http://carbon.wi.mit.edu:8000/cgi-bin/contig/sts_info?database=release). 
For example, searching Unigene with W56561, an EST for a MAK-like kinase, the 
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following information is retrieved: Chr.14, D14S65-qTEL. The location of this gene on 
an "ideogram" of the cytogenetic map of chromosome 14 is also provided, showing that 
W56561 maps to the bottom of chromosome 14, between 14q31 and 14qTel. If Unigene 
has not mapped the EST, then the nucleic acid for the gene of interest is used as a query 
5 against databases, such as dbsts and htgs (described at 

http://www.ncbi.nlm.nih.gov/BLAST/blast_databases.html) containing sequences that 
have been mapped already. The nucleic acid sequence is searched using BLAST-2 at 
NCB1 (http://www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-newblast) and is used to query 
either dbsts or htgs. In addition to the Whitehead and GDB sites mentioned above, 

10 Stanford University maintains a useful site for chromosomal mapping from STS data 

(http://www-shgc. Stanford. edu/RH/rhserverfommew.html). Matches in htgs are often 
resolved immediately because the genomic region hit is annotated in the htgs entry. If an 
exact match match is found (defined roughly as 99% identity over a region of about 100 
base pairs or longer, excluding any repetitive sequence), then the mapped position of the 

1 5 entry in the database is assigned to the original kinase query. Once a cytogenetic region 

has been identified by one of these approaches, disease association is established by 
searching OMIM (see above for URL) with the cytogenetic location. OMIM maintains a 
searchable catalog of cytogenetic map locations organized by disease. A thorough search 
of available literature for the cytogenetic region is alo made using Medline 

20 (http://www.ncbi.nlm.nih.gov/PubMed/medline.html). References for association of the 

mapped sites with chromosomal abnormalities found in human cancer can be found in: 
Knuutila, et a/., Am J Pathol, 1998, 152:1107-1123. 

Results 

25 The chromosomal location for 37 of the 1 10 novel protein kinases is shown on 

Table 1 . Three of the novel protein kinases were mapped to regions associated with 
cancer amplicons, as shown on this table. The regions were also cross-checked with the 
Mendelian Inheritance in Man database, which tracks genetic information for many human 
diseases, including cancer. References for association of the mapped sites with 

30 chromosomal abnormalities found in human cancer can be found in: Knuutila, et al., Am J 
Pathol, 1998, 152:1 107-1 123. Association of these mapped regions with other diseases is 
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documented in the Online Mendelian Inheritance in Man (OMIM) 
(http://www.ncbi.nlm.nih.gov/htbin-post/Omim). 

EXAMPLE 3: Generation of Specific Immunoreagents 

5 Materials and Methods 

Peptide sequences to extra-catalytic regions of novel kinases are chosen which are 
not homologous to other known kinases based on a Smith Waterman homology search 
against the non-redundant protein database and predicted to be antigenic based on the 
DNAStar Protean program. These peptides are conjugated to KLH using Glutaraldehyde. 

10 Rabbits are immunized with the KLH-peptide conjugates by four injections three 

weeks apart. The rabbits are bled ten and fourteen days following the third injection and 
bled out ten days after the fourth. The serum is checked against the peptide by EL1SA. 



Table 13. Peptides to be used as immunogens for raising antibodies 



Clone 
Name 


SEQ1D 
NO (aa) 


Peptide Sequence 


Amino Location 


AA8256850 


124 


KSRDNSRDSSQSEND 


339-353 






TEKLKRSQDLPREPLP 


372-386 






RGWPvPYDlHS 


223-232 


5R79-46-1 


126 


FEGPRRNKEVMYK 


224-236 






KDDYNETVHKKTE 


451-463 






GTHPKDRNVEKLQ 


541-553 






EV SKYQEYTNELQET 


643-657 


AA256100 


129 


IDDTSNFDDFPESD1 


405-419 






TEPDYKSKDWVFL 


427-439 






EEKKLRRSQHARKET 


61-75 


AA210825 


130 


SNKDTLRKRHYWRLD 


507-521 






RHTTRKSSTTLRE 


488-500 






FQNNTTNRYYKEJPL 


528-542 






GKHRKTGRDVAVK 


668-680 






FPTKQESQLRNE 


687-698 
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AA3 16804 


132 


ESHVHQEPSKRffS 


239-252 






HTKRKSSTMVKEGW 


409-422 






PSDLDVERDEEAVK 


375-388 






SPGQGKDHKDLSTSI 


543-557 


R47805 


143 


EPVGRWDQDYDRAVL 


44-58 






KPKGPGGKRGHKRL1 


325-339 






PTDVAQLPSRVPRDA 


219-233 


AA234451 


167 


DPFDWEKTGNDGSLT 


293-307 






HPRPQEKDVWEE 


374-385 






RENTDEVFPDEQLSD 


340-354 






RSEITQPDRDIPLVR 


427-441 


AA460132 


180 


LKSYSTSSKKARPVL 


222-236 






KKLDEVRLRGRKRSM 


237-251 






ETEKTAQGLSNLAKT 


131-145 


N34132 


183 


SGRRRRPTKSKGSKS 


1848-1862 






PGTAPSKPPLTKAPV 


1474-1488 






VDSDTQPKAPGIDD 


1365-1378 






AHSLDKTSHSSTTGL 


1253-1267 


5R69-17-2 


187 


GTTREKTDRVKST 


178-190 






HSEAPELHGKJRSSN 


138-152 






DETVTPPQFSIV 


87-98 






QYDVKSEIYS 


204-213 


AA278842 


206 


TVDPEKSVRDQAFKA 


515-529 






DSSTADRWDDEDWGS 


637-651 






SVSEDPTQLEEVEKD 


539-553 


AA836348 


232 
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EXAMPLE 4. Expression analysis of Novel Mammalian Protein Kinases 
GENE EXPRESSION ANALYSIS 
Tissue Arrays 

"cDNA libraries" derived from a variety of sources were immobilized onto nylon 
membranes and probed with 32P-labeled cDNA fragments derived from the gene(s) of 
interest. 

Total RNA or mRNA was used as template in a reverse transcription reaction to 
generate single-stranded cDNAs (ss cDNA) that were tagged with specific sequences at 
each end. An oligo dT primer containing a specific sequence (CDS: 
AAGCAGTGGTAACAACGCAGAGTACT30VN (V=A,G,C N=A,G,C,T)) anneals at 
the polyA track at the 3' end of the mRNA and the reverse transcriptase (MMLV 
RnaseH-) transcribes the antisense strand until it reaches the end of the RNA strand when 
it adds additional C residues. If a primer (SMII: 
AAGCAGTGGTAACAACGCAGAGTACGCGGG or ML2G: 

AAGTGGCAACAGAGATAACGCGTACGCGGG) ending with 3 Gs is added, it anneals 
to the added Cs and the MMLV recognizes the rest of the primer sequence as template and 
continues transcription. As a result, the synthesized cDNAs contain specific sequence tags 
at both the 5' and the 3 1 end. When the 5' and the 3' ends are tagged with the same 
sequence (CDS and SMII) it is referred to as "symmetric " When the 5' end is tagged 
with a different sequence than the 3' end (CDS and ML2G) is referred to as "asymmetric" 
A double-stranded "cDNA library 41 is then generated by PCR amplification using the 
3'PCR and ML2 primers (3' PCR: AAGCAGTGGTAACAACGCAGAGT and ML2: 
AAGTGGCAACAGAGATAACGCGT) that anneal to the added sequence tags. 

The amplified "cDNA libraries" were manually arrayed onto nylon membranes 
with a 384 pin replicator. The DNA was denatured by alkali treatment, neutralized and 
cross-linked by UV light. The arrays were pre-hybridized with Express Hyb (Clontech) 
and hybridized with 32P labeled probes generated by random hexamer priming of cDNA 
fragments corresponding to the genes of interest. After washing, the blots were exposed to 
phosphorimaging cassettes and the intensity of the signal was quantified. The amount of 
the DNA on the arrays was also quantified by treating non-denatured or denatured arrays 
with Syber Green I or Syber Green II respectively (1 : 100,000 in 50mM Tris, pH8.0) for 2 
minutes. After washing with 50mM Tris, pH8.0, the fluorescent emission was detected 



WO 00/73469 



PCI7US00/14842 



with a phosphorimager (Molecular Dynamics) and quantified. The amount of the arrayed 
DNA was used to normalize the hybridization signal and the corrected values are tabulated 
in Table 3. 

5 Results 

The results of the microarray expression analysis of the protein kinases presented 
in this application is shown in Table 3. Data presentation from left to right is as follows: 
"Tissue": tissue type of the cDNA; "Tumor sym", indicates that the tissue is derived from 
a tumor, "sym" refers to the fact that the 5' and 3' primers used to make the sample are the 

10 same; "Normal Sym", indicates normal tissue was used to make the sample, with 

symmetric primers as described above; "Tumor lo", indicates that primary tumor tissue 
was used to make the cDNA; "Tumor cells", indicates that these cDNA samples were 
made from cultured tumor cells; "Normal", indicates that these samples are derived from 
normal tissue or cell lines; "Endos", indicates that these samples are derived from 

1 5 endothelium-related tissue sources; "p53" refers to the status, mutant or wild-type, of the 

p53 gene in the source samples. Normalized expression values are presented for each 
gene referred to by its SEQ ED# on the subsequent columns. Genes represented in 
expression Table 3 are: SEQ ID NO:3 (AA826850), SEQ ED NO:5 (TBK1), SEQ ID NO:6 
(AA3051 76), SEQ ED NO:8 (AA256100), SEQ ED NO:9 (CAB43292), SEQ ED NO: 1 1 

20 (EPK2), SEQ ID NO: 1 2 (PKNbeta), SEQ ED NO:14 (H19102), SEQ ID NO: 16 (RSK4), 
SEQ ED NO: 17 (AAD30182), SEQ ID NO:20 (SGK2), SEQ ID NO:22 (PTK9L), SEQ ID 
NO:26 (AA383293), SEQ ED NO:29 (DRAK2), SEQ ID NO:31 (DRAX1), SEQ ID 
NO:032 (AA015726), SEQ ED NO:40 (MAK-V), SEQ ED NO:044 (TRAD), SEQ ID 
NO:044 (TRAD), SEQ ED NO:45 (AA454060), SEQ ID NO:47 (AA234451), SEQ ID 

25 NO:48 (AA436054), SEQ ID NO:49 (AA626859), SEQ ID NO:5 1 (Kl AA0904), SEQ ID 
NO:52 (AA789239), SEQ ID NO:54 (CCRK), SEQ ID NO:55 (CLK4), SEQ ID NO:56 
(AA557536), SEQ ID NO:57 (W56561), SEQ ED NO:60 (AA579641), SEQ ID NO:63 
(NEK7), SEQ ID NO:66 (CAMKKB), SEQ ED NO:68 (HEPK2), SEQ ID NO:72 
(R19609), SEQ ED NO:73 (HRT), SEQ ID NO:78 (AA088547), SEQ ED NO:79 

30 (AA449542), SEQ ED NO:082a (MLK4), SEQ ED NO:82 (MLK4b), SEQ ID NO:84 
(REP4), SEQ ID NO:88 (AA278842), SEQ ED NO:89 (AA 195964), SEQ ID NO:90 
(MSSK1), SEQ ED NO:93 (TSK4), SEQ ID NO:94 (AI025291), SEQ ED NO:95 
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(AA948538), SEQ ID NO:96 (AA905446), SEQ ID NO:97 (H85389), SEQ ID NO: 100 
(AA018361), SEQ ID NO:101 (AA31 1714), SEQ ID NO:l 10 (AA452647), SEQ ID 
NO:l 1 1 (AA310219), SEQ ID NO: 1 12 (Al 086865), SEQ ID NO:l 14 (MEKX6), and SEQ 
IDNO:ll6(SuRTK106). 

5 

EXAMPLE 5. Kinase assays for Erk, JNK1 and p38 MAP kinases 

293T cells were transiently transfected with HA- p38 or co-transfected with Flag- 
tagged wt MLK4A, kinase-dead MLK4A, wild-type MLK4B or kinase-dead MLK4B 
using Lipofectamine 2000 (Lifetech). Cells were lysed 36 hr post-transfection. Cell 

10 lysates normalized to contain equivalent amounts of HA-p38 were immunoprecipitated 

with anti-HA antibody (Mab HA-1 1, Babco). Immunoprecipitates were split in two 
portions, one portion was Western-blotted with anti- HA antibody and the other with a 
phospho-specific p38 antibody (Promega) to detect activated levels of p38. Activation of 
Erkl and Jnkl was measured similarly. (This example applies to AA232253 (SEQ ID 

15 NO:82,SEQIDNO:201).) 

Results : 

In transient assays wild-type MLK4A and MLK4B (but not kinase-inactive 
MLK4A(K45M) or MLK4BQC45M)) activate Erk, JNK1 and p38 MAP kinases. 
20 EXAMPLE 6. RAO guanine-exchange factor assay 

293T cells were transiently transfected with HA-Racl or co-transfected with Flag- 
tagged Duet C, Duet E, Dbl and HA-Tiam-1 . Cells were lysed 36 hour post-transfection. 
Cell lysates normalized to contain equivalent amounts of Racl were affinity precipitated 
with immobilized GST-PBD (p21 -binding domain of Pak3). Bound proteins were 
25 Western blotted and probed with anti-HA antibody to detect levels of activated Racl . 

((This example applies to R199772 (Trad/Duet)(SEQ ID NO:44, SEQ ID NO:164).) 



Results : 

Duet C and Duet E both act as guanine nucleotide exchange factors on Racl . 
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CONCLUSION 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
5 those inherent therein. The molecular complexes and the methods, procedures, treatments, 

molecules, specific compounds described herein are presently representative of preferred 
embodiments are exemplary and are not intended as limitations on the scope of the 
invention. Changes therein and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention are defined by the scope of the claims. 

1 0 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. 

All patents and publications mentioned in the specification are indicative of the 
levels of those skilled in the art to which the invention pertains. 

15 The invention illustratively described herein suitably may be practiced in the 

absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially of and "consisting of 5 may be replaced with either 
of the other two terms. The terms and expressions which have been employed are used as 

20 terms of description and not of limitation, and there is no intention that in the use of such 

terms and expressions of excluding any equivalents of the features shown and described or 
portions thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. 

In particular, although some formulations described herein have been identified by 

25 the excipients added to the formulations, the invention is meant to also cover the final 

formulation formed by the combination of these excipients. Specifically, the invention 
includes formulations in which one to all of the added excipients undergo a reaction 
during formulation and are no longer present in the final formulation, or are present in 
modified forms. 

30 In addition, where features or aspects of the invention are described in terms of 

Markush groups, those skilled in the art will recognize that the invention is also thereby 
described in tenns of any individual member or subgroup of members of the Markush 
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group. For example, if X is described as selected from the group consisting of bromine, 
chlorine, and iodine, claims for X being bromine and claims for X being bromine and 
chlorine are fully described. 

Other embodiments are within the following claims. 
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What is claimed is: 

CLAIMS 

1 . An isolated, enriched, or purified nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ 
5 ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ 
ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ 
ID NO: 1 34, SEQ ED NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ 
ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID NO: 1 43, SEQ 
ED NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ 

10 ID NO: 149, SEQ ID NO:150, SEQ ED NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ 
ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ 
ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ 
ID NO: 1 64, SEQ ED NO: 1 65 . SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 169, SEQ ED NO: 1 70, SEQ ED NO: 1 71 , SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ 

15 ID NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 

ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ED NO: 182, SEQ ID NO: 183, SEQ 
ID NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ 
ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ 
ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ 

20 ED NO: 199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 

ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:21 5, SEQ ID NO:216, SEQ ID NO:21 7, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 

25 ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 

ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ED NO:233, SEQ 
ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242. 
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2. The nucleic acid molecule of claim 1, wherein said nucleic acid molecule 
comprises a nucleotide sequence that: 

(a) encodes a polypeptide comprising the amino acid sequence set forth 
in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ ID 
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(b) is the complement of the nucleotide sequence of (a); 

(c) hybridizes under highly stringent conditions to the nucleotide 
molecule of (a) and encodes a naturally occurring kinase polypeptide; 
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(d) encodes a kinase polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
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but not all, of a domain selected from the group consisting of an N-terminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; 
30 (e) is the complement of the nucleotide sequence of (d); 
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(f) encodes a domain of an amino acid sequence selected from the 
group set forth in SEQ ED NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ED NO: 126, SEQ ED NO:127, SEQ ED NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, 
SEQ ED NO: 131, SEQ ED NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, 

5 SEQ ED NO: 136, SEQ ED NO:137, SEQ ED NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, 
SEQ ED NO: 141, SEQ ED NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, 
SEQ ED NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ED NO:149, SEQ ID NO:150, 
SEQ ED NO:151, SEQ ID NO:152, SEQ ED NO: 153, SEQ ID NO:l 54, SEQ ID NO:l 55, 
SEQ ED NO: 156, SEQ ED NO:157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 

10 SEQ ED NO: 161, SEQ ED NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. 
SEQ ED NO: 166, SEQ ED NO:167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ ED NO: 170, 
SEQ ID NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, 
SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, 
SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 

15 SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, 
SEQ ED NO: 191, SEQ ED NO: 199, SEQ ED NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, 
SEQ ED NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, 
SEQ ED NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ED NO:210, 

20 SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, 
SEQ ID NO:221 , SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, 
SEQ ID NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ED NO:232, SEQ ED N0.233, SEQ ED NO:234, SEQ ID NO:235, 

25 SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, 

SEQ ID NO:241, and SEQ ID NO:242, wherein said domain is selected from the group 
consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil 
structure region, a proline-rich region, a spacer region, an insert, and a C-terminal tail; 

(g) is the complement of the nucleotide sequence of (f); 

30 (h) encodes a polypeptide comprising an amino acid sequence selected 

from the group consisting of SEQ ID NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ ID 
NO:125, SEQ ED NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID 
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NO:130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID 
NO:135, SEQ ED NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID 
NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID 
NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID 

5 NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID 

NO: 155, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 158, SEQ ID NO: 1 59, SEQ ID 
NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID 
NO:165. SEQ ID NO:166, SEQ ID N0:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID 
NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID 

10 NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 177, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID 
NO: 1 80, SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID 
NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID 
NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID 
NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID 

1 5 NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:21 7, SEQ ID NO:21 8, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 

20 NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 

NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID N0.234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it lacks one or more, but not 
all, of the domains selected from the group consisting of an N-terminal domain, a catalytic 

25 domain, a C-terminal domain, a spacer region, a proline-rich region, a coiled-coil structure 
region, and a C-terminal tail; or 

(i) is the complement of the nucleotide sequence of(h). 
3. The nucleic acid molecule of claim 1 , further comprising a vector or 
promoter effective to initiate transcription in a host cell. 
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4. The nucleic acid molecule of claim 1 , wherein said nucleic acid molecule is 
isolated, enriched, or purified from a mammal. 

5. The nucleic acid molecule of claim 4, wherein said mammal is a human. 

6. A nucleic acid probe for the detection of nucleic acid encoding a kinase 

5 polypeptide in a sample, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO: 122, SEQ ED NO: 123, SEQ ED NO: 124, SEQ ED NO: 125, SEQ ID NO: 126, 
SEQ ED NO: 1 27, SEQ ED NO: 1 28, SEQ ED NO: 1 29, SEQ ED NO: 1 30, SEQ ID NO: 1 3 1 , 
SEQ ED NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ED NO:135, SEQ ED NO: 136, 
SEQ ED NO: 1 37, SEQ ED NO: 138, SEQ ED NO: 1 39, SEQ ED NO: 1 40, SEQ ID NO: 1 4 1 , 

10 SEQ ED NO:142, SEQ ED NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ ID NO:146,' 

SEQ ED NO: 147, SEQ ED NO: 148, SEQ ED N0.149, SEQ ID NO:150, SEQ ID NO:151, 
SEQ ED NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO:l 54, SEQ ID NO:l 55, SEQ ID NO: 1 56, 
SEQ ED NO: 1 57, SEQ ID NO: 158, SEQ ED NO: 1 59, SEQ ID NO:l 60, SEQ ID NO: 1 61 , 
SEQ ED NO: 162, SEQ ED NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, 

15 SEQ ED NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, 

SEQ ED NO: 172, SEQ ED NO: 173, SEQ ID NO: 174, SEQ ED NO: 175, SEQ ED NO: 176, 
SEQ ED NO: 177, SEQ ED NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ED NO: 181, 
SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, 
SEQ ED NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , 

20 SEQ ED NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, 
SEQ ED NO: 197, SEQ ED NO:198, SEQ ED NO:199, SEQ ED NO:200, SEQ ED NO:201, 
SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, 
SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:21 1, 
SEQ ID NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, 

25 SEQ ID NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221, 
SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231 , 
SEQ ED NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, 
SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, 

30 and SEQ ED NO:242. 
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7. The probe of claim 6, wherein said polypeptide is a fragment of the protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ED 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ED NO:126, SEQ ID 
NO: 1 27, SEQ ED NO: 1 28, SEQ ID NO: 1 29, SEQ ED NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID 
5 NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ED NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ED NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ED NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ID 
NO: 152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 

10 NO: 1 57, SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ID 
NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ID 
NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED 

15 NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID 
NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

20 NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ID 
NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 
NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID 

25 NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID 
NO:237, SEQ EDNO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ 
ED NO:242. 
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8. A recombinant cell comprising a nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ED NO: 123, SEQ 
ID NO:124, SEQ IDNO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ 
ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ 
5 ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ 
ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ 
ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ 
ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ 
ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ 

10 ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ 
ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ 
ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 
ID NO: 1 79, SEQ ID NO:l 80, SEQ ID NO: 1 81 , SEQ ID NO:l 82, SEQ ID NO: 1 83, SEQ 

15 ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ 
ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ 
ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ 
ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 
ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 

20 ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ EDNO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 

25 ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 

ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242. 
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9. The cell of claim 8, wherein said polypeptide is a fragment of a protein 



encoded by an amino acid sequence selected from the group consisting of SEQ ID 





NO:122, 


SEQ 


ID 


NO:123, 


SEQ 


ID 


NO:124, 


SEQ 


IDNO:125, 


SEQ 


lDNO:126, 


SEQ 


ID 




NO: 127, 


SEQ 


ID 


NO: 128, 


SEQ 


ID 


NO: 129, 


SEQ 


ID NO: 130, 


SEQ 


IDNO:131, 


SEQ 


ID 


5 


NO: 132, 


SEQ 


ID 


NO:133, 


SEQ 


ID 


NO: 134, 


SEQ 


ID NO: 135, 


SEQ 


ID NO: 136, 


SEQ 


ID 




NO:137, 


SEQ 


ID 


NO:138, 


SEQ 


ID 


NO:139, 


SEQ 


E)NO:140, 


SEQ 


ID NO:141, 


SEQ 


ID 




NO: 142, 


SEQ 


ID 


NO: 143, 


SEQ 


ID 


NO: 144, 


SEQ 


ID NO: 145, 


SEQ 


ID NO: 146, 


SEQ 


ID 




NO:147, 


SEQ 


ID 


NO:148, 


SEQ 


ID 


NO: 149, 


SEQ 


ID NO: 150, 


SEQ 


IDNO:151, 


SEQ 


ID 




NO: 152, 


SEQ 


ID 


NO: 153, 


SEQ 


ID 


NO: 154, 


SEQ 


ID NO: 155, 


SEQ 


ED NO: 156, 


SEQ 


ID 


10 


NO: 157, 


SEQ 


ID 


NO:158, 


SEQ 


ID 


NO: 159, 


SEQ 


ID NO: 160, 


SEQ 


EDN0:161, 


SEQ 


ID 




NO: 162, 


SEQ 


ID 


NO: 163, 


SEQ 


ID 


NO: 164, 


SEQ 


IDNO:165. 


SEQ 


ID NO: 166, 


SEQ 


ID 




NO: 167, 


SEQ 


ID 


NO: 168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID NO: 170, 


SEQ 


EDNO:171, 


SEQ 


ID 




NO: 172, 


SEQ 


ID 


NO: 173, 


SEQ 


ID 


NO: 174, 


SEQ 


IDNO:175, 


SEQ 


ID NO: 176, 


SEQ 


ID 




NO: 177, 


SEQ 


ID 


NO:178, 


SEQ 


ED 


NO: 179, 


SEQ 


ID NO: 180, 


SEQ 


ID NO: 181, 


SEQ 


ID 


15 


NO: 182, 


SEQ 


ID 


NO:183, 


SEQ 


ID 


NO: 184, 


SEQ 


IDNO:185, 


SEQ 


IDNO:186, 


SEQ 


ID 




• NO:187, 


SEQ 


ID 


NO:188, 


SEQ 


ID 


NO: 189, 


SEQ 


ID NO: 190, 


SEQ 


IDN0.191, 


SEQ 


ID 




NO: 199, 


SEQ 


ID 


NO:193, 


SEQ 


ID 


NO: 194, 


SEQ 


ID NO: 195, 


SEQ 


ID NO: 196, 


SEQ 


ID 




NO: 197, 


SEQ 


ID 


NO:198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID NO:200, 


SEQ 


LDNO:201, 


SEQ 


ID 




NO:202, 


SEQ 


ID 


NO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID NO:205, 


SEQ 


ID NO:206, 


SEQ 


ID 


20 


NO:207, 


SEQ 


ID 


NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


IDNO:210, 


SEQ 


IDN0.211, 


SEQ 


ID 




NO:212, 


SEQ 


ID 


NO:213, 


SEQ 


ID 


NO:214, 


SEQ 


IDNO:215, 


SEQ 


IDNO:216, 


SEQ 


ID 




NO:217, 


SEQ 


ID 


NO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID NO:220, 


SEQ 


tDNO:221, 


SEQ 


ID 




NO:222, 


SEQ 


ID 


NO:223, 


SEQ 


ID 


NO:224, 


SEQ 


ID NO:225, 


SEQ 


DO NO:226, 


SEQ 


ID 




NO:227, 


SEQ 


ID 


NO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID NO:230, 


SEQ 


IDNO:231, 


SEQ 


ID 


25 


NO:232, 


SEQ 


ID 


NO:233, 


SEQ 


ID 


NO:234, 


SEQ 


ID NO:235, 


SEQ 


EDNO:236, 


SEQ 


ID 




NO:237, 


SEQ 


ID 


NO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID NO:240, 


SEQ 


IDNO:241, 


and SEQ 



ID NO:242. 
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1 0. An isolated, enriched, or purified kinase polypeptide selected from the 
group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, 
5 SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 139, SEQ ID NO: 1 40, 
SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, 
SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 

10 SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. 
SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, 
SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, 
SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, 
SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO:l 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 

15 SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO:l 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, 
SEQ ID NO. i 91 , SEQ ID NO : 1 99, SEQ ID NO: 1 93 , SEQ ID NO: 1 94, SEQ ID NO: 1 95, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ IDNO:200, 
SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, 

20 SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 

SEQ ED NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:21 9, SEQ ID NO:220, 
SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ED NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 

25 SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, 
SEQ ED NO:241, and SEQ ED NO:242. 
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1 1 . The polypeptide of claim 10, wherein said polypeptide is a fragment of the 
protein encoded by an amino acid sequence selected from the group consisting of SEQ ID 





NO: 122, 


SEQ 


ID NO: 123, 


SEQ 


ID 


NO: 124, 


SEQ 


ID 


NO:125, 


SEQ 


ID NO: 126, SEQ ID 




NO: 127, 


SEQ 


ID NO: 128, 


SEQ 


ID 


NO:129, 


SEQ 


ID 


NO: 130, 


SEQ 


IDNO:131,SEQID 


5 


NO: 132, 


SEQ 


IDNO:133, 


SEQ 


ID 


NO: 134, 


SEQ 


ID 


NO:135, 


SEQ 


ID NO: 136, SEQ ID 




NO: 137, 


SEQ 


ID NO: 138, 


SEQ 


ID 


NO:139, 


SEQ 


ID 


NO: 140, 


SEQ 


IDNO:141,SEQID 




NO: 142, 


SEQ 


IDNO:143, 


SEQ 


ID 


NO: 144, 


SEQ 


ID 


NO: 145, 


SEQ 


ID NO: 146, SEQ ID 




NO:147, 


SEQ 


ID NO: 148, 


SEQ 


ID 


NO: 149, 


SEQ 


ID 


NO: 150, 


SEQ 


EDNO:151,SEQ ID 




NO: 152, 


SEQ 


IDNO:153, 


SEQ 


ID 


NO: 154, 


SEQ 


ID 


NO:155, 


SEQ 


IDNO:156, SEQ ID 


10 


NO: 157, 


SEQ 


ID NO: 158, 


SEQ 


ID 


NO:159, 


SEQ 


ID 


NO: 160, 


SEQ 


IDNO:161,SEQ ID 




NO:162, 


SEQ 


ID NO: 163, 


SEQ 


ID 


NO: 164, 


SEQ 


ID 


NO:165. 


SEQ 


ID NO: 166, SEQ ID 




NO:167, 


SEQ 


ID NO: 168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID 


NO: 170, 


SEQ 


IDNO:171,SEQ ID 




NO: 172, 


SEQ 


ID NO: 173, 


SEQ 


ID 


NO:174, 


SEQ 


ID 


NO:175, 


SEQ 


ID NO: 176, SEQ ID 




NO: 177, 


SEQ 


ID NO: 178, 


SEQ 


ID 


NO:179, 


SEQ 


ID 


NO:180, 


SEQ 


IDNO:181,SEQ ID 


15 


NO: 182, 


SEQ 


ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO:185, 


SEQ 


ID NO: 186, SEQ ID 




NO: 187, 


SEQ 


ID NO: 188, 


SEQ 


ID 


NO:189, 


SEQ 


ID 


NO: 190, 


SEQ 


ID NO:191,SEQ ID 




NO: 199, 


SEQ 


ID NO: 193, 


SEQ 


ID 


NO: 194, 


SEQ 


ID 


NO:195, 


SEQ 


ID NO: 196, SEQ ID 




NO: 197, 


SEQ 


ID NO: 198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


n)NO:201,SEQID 




NO:202, 


SEQ 


ID NO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID NO:206, SEQ ID 


20 


NO:207, 


SEQ 


IDNO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


EDN0:211,SEQ1D 




NO:212, 


SEQ 


IDNO:213, 


SEQ 


ID 


NO:214, 


SEQ 


ID 


NO:215, 


SEQ 


ID N0:216, SEQ ID 




NO:217, 


SEQ 


IDNO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


ID N0:221,SEQ ID 




NO:222, 


SEQ 


ID NO:223, 


SEQ 


ID 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


ID NO:226, SEQ ID 




NO:227, 


SEQ 


ID NO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ 


ID NO:231,SEQID 


25 


NO:232, 


SEQ 


1DN0:233, 


SEQ 


ID 


NO:234, 


SEQ 


ID 


NO:235, 


SEQ 


ID NO:236, SEQ ID 




NO:237, 


SEQ 


IDNO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID 


NO:240, 


SEQ 


IDNO:241,and SEQ 



ID NO:242. 

12. The polypeptide of claim 10, wherein said polypeptide comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ 
ED NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ED NO: 131, SEQ 
DO NO:132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ED NO: 136, SEQ 



WO 00/73469 



144 



PCT/US00/14842 



IDNO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO: 140, SEQ ID NO:141, SEQ 
ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ 
ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 
ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO: 157, SEQ ID NO: 1 58, SEQ ED NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ 
ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ 
ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 171, SEQ 
ED NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ 
ID NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ 
ID NO:l 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ 
ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ 
ID NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ 
ID NO: 1 97, SEQ ID NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:21 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID N0.237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and 
SEQ ID NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ED NO: 125, SEQ ED NO: 126, SEQ 
ID NO.l 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ 
ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ 
ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ 
ID NO: 1 42, SEQ ID NO: 1 43, SEQ ED NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ 
ID NO: 1 47, SEQ ID NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 
ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ 
ID NO:l 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ 
ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ 
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ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ 
ID NO:172, SEQ ID NO: 173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ 
ID NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO : 1 80, SEQ ID NO: 1 8 1 , SEQ 
ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ 

5 ED NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ 
ID NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ 
ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:20 1 , SEQ 
ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ 

10 ID NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 

ED NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ 
ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ 

1 5 ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ED NO:24 1 , and 

SEQ ID NO:242, except that it lacks one or more, but not all of the domains selected from 
the group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail 
(c) a domain of an amino acid sequence selected from the group set 

20 forth in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ ID 
NO:126, SEQ ID NO:127, SEQ ED NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID 
NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID 
NO:136, SEQ ID NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID 

25 NO: 146, SEQ ED NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ID 
NO:l 51 , SEQ ID NO:152, SEQ ID NO.l 53, SEQ ID NO:l 54, SEQ ED NO:l 55, SEQ ID 
NO: 156, SEQ ID NO: 157, SEQ ED NO:158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID 
NO: 161, SEQ ED NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ED 
NO: 1 66, SEQ ID NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ID 

30 NO:171, SEQ ED NO: 172, SEQ ED NO:173, SEQ ED NO:174, SEQ IDNO:175, SEQ ID 
NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID 
NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID 
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NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID 
NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID 
NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID 
NO:201 , SEQ ID NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID . 
NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID NO:2l0, SEQ ID 
NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID 
NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID 
NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID 
NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ID 
NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241 , and SEQ ED NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

13. The kinase polypeptide of claim 10, wherein said polypeptide is isolated, 
purified, or enriched from a mammal. 

14. The kinase polypeptide of claim 13, wherein said mammal is a human. 

15. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA144574, AA1 16841, AA2561 00, AA305176, AA210825, AA3 16804, AA980090, 
N42050, AA476563, AA626690, AA960957, HI 91 02, AA045601, AA107515, 

AA1 09508 or AA887783 polypeptide. 

1 6. The kinase polypeptide of claim 1 0, wherein said polypeptide is a H602 1 5, 
AA 197883, AA297313, W30246, AA 172300, AA383293, AA542015, H01 248, N23936, 
W44160, 2R22-5-1 1, 5R72-18-1, AA021445, AA207220, AA426580, AA544838, 
W90839, 5R79-54-1, AA839940, R 19772 or 5R72-8-2 polypeptide. 

1 7. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA234451 polypeptide. 

18. The kinase polypeptide of claim 10, wherein said polypeptide is a 5R65-16- 
1, AA061797, AA065538, AA124976, AA397553, AA435956, AA575635, AA626859, 
AA789239, A1086865, HI 7727, H29974, AA557536 or N28606 polypeptide. 

19. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA63 1 990 or W08549 polypeptide. 
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20. The kinase polypeptide of claim 10, wherein said polypeptide is a 5R72-16- 
2, R19927 or R43524 polypeptide. 

21. The kinase polypeptide of claim 10, wherein said polypeptide is a 5R57-10- 
2 polypeptide. 

22. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA232253 polypeptide. 

23. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA430250, AA836348, R86668 or N34132 polypeptide. 

24. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA098024or SuRTK106 polypeptide. 

25. The kinase polypeptide of claim 1 0, wherein said polypeptide is a R47805, 
AA099102, AA589241, H8581 1, AA013524, AA452647, AA840598, AA088547, 

AA1 39478, AA826850, R87679, W65887, H97685, W20810, AA599286, AA425725, 
AA103218, AA711829, AA060026, AA399669, AA758539, AA883975, AA948538, 
AA018361, AA21531 1, AA311714, AA498104, 5R69-17-2, 5R69-23-3, 5R69-26-2, 
AA118352, AA396601, AA671275, AA278842, AA460132 orH05721 polypeptide. 
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26. An antibody or antibody fragment having specific binding affinity to a 
kinase polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ED NO: 127, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 
5 NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID 
NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID 

10 N0.158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID 
NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO:l 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 

15 NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO:l 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID 
NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID 
NO:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO.201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 

20 NO:208, SEQ ID NO:209, SEQIDNO:210, SEQIDNO:211, SEQIDNO:212, SEQ ID 
NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 1 5, SEQ ID NO:2 1 6, SEQ ID NO:2 1 7, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO.220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 

25 NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

27. The antibody or antibody fragment of claim 26, wherein said polypeptide 
comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ 
ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ED NO: 1 3 1 , SEQ 
ID NO:132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ DD NO: 135, SEQ ID NO: 136, SEQ 
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ED NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ 
ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ 
ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 
ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
5 ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ 

ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ 
ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ 
ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ 
ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ 

1 0 ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO:l 86, SEQ 
ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ 
ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ 
ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 

15 ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ 

ID NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 1 5, SEQ ID NO:2 1 6, SEQ 
ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ 

20 ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241 , and 
SEQ ID NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1 22, SEQ ED NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ 

25 ID NO: 1 27, SEQ ED NO: 128, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ 
IDNO:132, SEQ ED NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ 
ED NO: 1 37, SEQ ED NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ 
ID NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ED NO: 146, SEQ 
ID NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 

30 ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO: 157, SEQ ID NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ 
ED NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ 
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ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 71 , SEQ 
ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ 
ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181 , SEQ 
ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ED NO: 186, SEQ 
ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ 
ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ 
ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ IDNO:200, SEQ ID NO:201, SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 10, SEQ ID NO:2 1 1 , SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO.230, SEQ ID NO:231, SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and 
SEQ ID NO:242, except that it lacks one or more, but not all, of the domains selected from 
the group consisting of a C-terminal domain, a catalytic domain, an N-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

(c) a domain of an amino acid sequence selected from the group set 
forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID 

NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID 
NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID 
NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID 
NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID 
NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID 
NO:l 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID 
NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID 
NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID 
NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID 
NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID 
NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID 
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NO:l 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED 
NO: 191, SEQ ED NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED 
NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, SEQ ED 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED 

5 NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID 
NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ED 
NO:216, SEQ ED NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 

10 NO:231, SEQ ID NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID N0.239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 
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28. A hybridoma which produces an antibody having specific binding affinity 
to a kinase polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 

5 NO:133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 
NO:l 53, SEQ ID NO:l 54, SEQ ID NO: 1 55, SEQ ID NO:l 56, SEQ ID NO: 1 57, SEQ ID 

10 NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO:161, SEQ ID NO: 162, SEQ ID 

NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID 
NO: 1 73, SEQ ID NO:l 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 

15 NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID 
NO:l 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID 
NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ E>NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 

20 NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID 

25 NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID N0.236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

29. A method for identifying a substance that modulates kinase activity 
comprising: 

(a) contacting a kinase polypeptide selected from the group consisting 
30 SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, 
SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , 
SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, 
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NO:144, 


SEQ 


ID 


NO:145, 


SEQ 


ID 


NO: 146, 


SEQ 


ID 


NO:147, 


SEQ 


EDNO:148, 


SEQ 


ID 


NO:149, 


SEQ 


ID 


NO:150, 


SEQ 


ID 


NO:151, 


SEQ 


ID 


NO:152, 


SEQ 


IDNO:153, 


SEQ 


ID 


NO: 154, 


SEQ 


ID 


NO:155, 


SEQ 


ID 


NO: 156, 


SEQ 


ID 


NO:157, 


SEQ 


IDNO:158, 


SEQ 


ID 


NO: 159, 


SEQ 


ID 


NO:160, 


SEQ 


ID 


NO: 161, 


SEQ 


ID 


NO: 162, 


SEQ 


ID NO: 163, 


SEQ 


ID 


NO: 164, 


SEQ 


ID 


NO:165. 


SEQ 


ID 


NO: 166, 


SEQ 


ID 


NO: 167, 


SEQ 


ID NO: 168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID 


NO: 170, 


SEQ 


ID 


NO:171, 


SEQ 


ID 


NO: 172, 


SEQ 


ID NO: 173, 


SEQ 


ID 


NO: 174, 


SEQ 


ID 


NO:175, 


SEQ 


ID 


NO: 176, 


SEQ 


ID 


NO: 177, 


SEQ 


ID NO: 178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: 180, 


SEQ 


ID 


NO:181, 


SEQ 


ID 


NO: 182, 


SEQ 


ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO: 185, 


SEQ 


ID 


NO: 186, 


SEQ 


ID 


NO: 187, 


SEQ 


IDNO:188, 


SEQ 


ID 


NO: 189, 


SEQ 


ID 


NO:190, 


SEQ 


ID 


NO:191, 


SEQ 


ID 


NO: 199, 


SEQ 


IDNO:193, 


SEQ 


ID 


NO: 194, 


SEQ 


ID 


NO:195, 


SEQ 


ID 


NO: 196, 


SEQ 


ID 


NO: 197, 


SEQ 


ID NO: 198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


ID 


NO:201, 


SEQ 


ID 


NO:202, 


SEQ 


IDNO:203, 


SEQ 


ID 
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ID NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


ID 


NO:211, 


SEQ 


ID 


NO:212, 


SEQ 


IDNO:213, 


SEQ 
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NO:214, 
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NO:215, 
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NO:217, 
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and SEQ ID NO:242 with a test substance; 

(b) measuring the activity of said polypeptide; and 

(c) determining whether said substance modulates the activity of said 

25 polypeptide. 

30. A method for identifying a substance that modulates kinase activity in a 
cell comprising: 

(a) expressing a kinase polypeptide in a cell, wherein said polypeptide 
is selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID 
30 NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID 
NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID 
NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138 3 SEQ ID 
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NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ED NO: 143, SEQ ID 
NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID 
NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ED NO: 153, SEQ ED 
NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID 
NO: 159, SEQ ID NO:160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID 
NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 
NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73 , SEQ ID 
NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID 
NO: 184, SEQ ID NO:l 85, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID 
NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID 
NO: 194, SEQ ID NO: 195, SEQ ED NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID 
NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ED 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ED NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ID 
NO:214, SEQIDNO:215, SEQEDNO:216, SEQE)NO:217, SEQDDNO:218, SEQ ID 
NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ED N0.227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ED NO:233, SEQ ID 
NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ED NO:242; 

(b) adding a test substance to said cell; and 

(c) monitoring a change in cell phenotype or the interaction between 
said polypeptide and a natural binding partner. 
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31. A method for treating a disease or disorder by administering to a patient in 
need of such treatment a substance that modulates the activity of a kinase selected from 
the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID 
NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID 

5 NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ED NO: 133, SEQ ID NO: 134, SEQ ID 
NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID 
NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID 
NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID 

10 NO: 155, SEQ ID NO: 156, SEQ ID NO:157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID 
NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID 
NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID 
NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ ID 
NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID 

15 NO: 1 80, SEQ DDN0:181,SEQ1DN0:182, SEQ ID NO: 1 83, SEQ ED NO: 1 84, SEQ ID 
NO:l 85, SEQ ID NO:l 86, SEQ ID NO:187, SEQ ID NO:l 88, SEQ ID NO:189, SEQ ID 
NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID 
NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID 
NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 

20 NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 

25 NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241 , and SEQ ED N0.242. 

32. The method of claim 31, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 

30 neurodegenerative disorders, and cancer. 

33. The method of claim 3 1 , wherein said substance modulates kinase activity 

in vitro. 
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34. The method of claim 33, wherein said substance is a kinase inhibitor. 

35. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) contacting said sample with a nucleic acid probe which hybridizes 
5 under hybridization assay conditions to a nucleic acid target region of a kinase polypeptide 
selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
SEQ ID N0.125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO:l 30, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, 
SEQ ID NO: 1 35, SEQ ID NO: 136, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, 

10 SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, 
SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54 , 
SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, 
SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 

15 SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, 

SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, 
SEQ ID NO:l 80, SEQ ID NO:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID N0.184, 
SEQ ED NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 

20 SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, 
SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, 
SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, 

25 SEQ ED NO:2 1 5, SEQ ED NO:2 1 6, SEQ ED NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, 
SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, 
SEQ ED NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ID NO:229, 
SEQ ED NO:230, SEQ ED NO:23 1 , SEQ ED NO:232, SEQ ED NO:233, SEQ ID NO:234, 
SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, 

30 SEQ ED NO:240, SEQ ED NO:241, and SEQ EO NO:242, said probe comprising the 

nucleic acid sequence encoding said polypeptide, fragments thereof, or the complements 
of said sequences and fragments; and 
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(b) detecting the presence or amount of the probe:target region hybrid 
as an indication of said disease. 

36. The method of claim 35, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 

5 neurodegenerative disorders, and cancer. 

37. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) comparing a nucleic acid target region encoding said kinase 
polypeptide in a sample, wherein said kinase polypeptide is selected from the group 

0 consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 
ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ 
ID NO:131, SEQ ED NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ 
ID NO:136, SEQ ED NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ 
ID NO: 141, SEQ IDNO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO: 145, SEQ 

[5 DD NO: 146, SEQ ED NO: 1 47, SEQ ED NO: 1 48, SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ 
ID NO:151, SEQ ED NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ 
ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ 
ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ 
ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ 

10 ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ 

DO NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ 
ID NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ 
E)NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ED NO:194, SEQ ID NO:195, SEQ 

15 ID NO: 1 96, SEQ ID NO: 1 97, SEQ ED NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ 
ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ 
ID NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID N0:219, SEQ ID NO:220, SEQ 

30 ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
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ID NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ID NO:242, or one or more fragments thereof, with a control nucleic 
acid target region encoding said kinase polypeptide, or one or more fragments thereof; and 
(b) detecting differences in sequence or amount between said target 
5 region and said control target region, as an indication of said disease or disorder. 

38. The method of claim 37, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 
neurodegenerative disorders, and cancer. 
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FIGURE 1A 

SEO ID NO: 122 X69117_H BARK2_H 

MADLEAVIJ^VSYLMAMEKSKATPAARASKRIVLPEPSIRSVMQKYLiAERNEITFDKIFN 
OKIGFLLFKDFCLNEINEAVPQVKFYEEIKEYEKLDNEEDR-LCRSRQIYDAYIMKELLSC 
SHPFSKQAVEHVQSHLSKKQVTSTLFQPYIEEICESLRGDIFQKFMESDKFTRFCQWKNV 
ELNIHLTMNEFSVHRIIGRGGFGEVYGCRKADTGKMYAMKCLDKKRIKMKQGETLALNER 
TMLSLVSTGDCPFIVCMTYAFHTPDKLCFILDLMNGGDLHYHLSQHGVFSEKEMRFYATE 
IILGLEHVHNRFWYRDLKPANILLDEHGHARISDLGLACDFSKKKPHASVGTHGYMAPE 

VLOKGTAYDS SADWFSLGCMLFKLLRGH S PFRQHKTKDKHE I DRMTLTVNVELPDTF SPE 
LKSLLEGLLQRDVSKRLGCHGGGSQEVKEHSFFKGVDWQHVYLQKYPPPLIPPRGEVNAA 
DAFDIGSFDEEDTKGIKLLDCDQELYKNFPLVISERWQQEVTETVYEAVNADTDKIEARK 
RAKNKQLGHEEDYALGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQNL 
LTMEQILSVEETQIKDKKCILFRIKGGKQFVLQCESDPEFVQWKKELNETFKEAQRLLRR 

APKFLNKPRSGTVELPKPSLCHRNSNGL 

9F0 ID NO- 123 AA144574_M BARK2_M 

cfvvyrdlkpanilIdeyghvrisdlglacdfskkkphasvgthgymapevlqkgtcyds 

SADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTVNVQLPDAFSPELRSLLEGLLQ 
RDVSORLGCGGGGARELKEHIFFKGIDWQHVYLRKYPPPLIPPRGEVNAADAFDIGSFDE 

EDTKGI KLLDCDQDLYKNFPLVI SERWQQEWETI YDAVNADTDKI EARKKAKNKQLGQE 
EDYAMGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQSLLTMEQIMSVE 
ETQI KDRKCILLRI KGGKQFVLQCESDPEFAQWLKELTCTFNEAQRLLRRAPKFLNKPRA 

AI LE FSKPPLCHRNS SGL 

SEO ID NO: 124 AA826850_H 

MGSSMSAATARRPVFDDKEDVNFDHFQILRAIGKGSFGKVCIVQKRDTEKMYAMKYMNKQ 
OCIERDEVRNVFRELEILQEIEHVFLVNLWYSFQDEEDMFMWDLLLGGDLRYHLQQNVQ 

FSEDTVRLY I CEMALALDYLRGQHI I HRDVKPDNI LLDERGHAHLTDFN I ATI I KDGERA 
TALAGTKPYMAPEIFXSFVNGGTGYSFEVDWWSVGVMAYELLRGWRPYDIHSSNAVESLV 
OLFSTVSVQYVPTWSKEMVALLRKLLTVNPEHRLSSLQDVQAAPALAGVLWDHLSEKRVE 
PGFVPNKGRLHCDPTFELEEMILESRPLHKKKKR^KNKSRDNSRDS^ 

AIQQDFVIFNRE* 



2KLKRSQDLPREPLPAPESRDAAEPVEDEAERSALPMCGPICPSAGSG 



qEO ID NO: 125 AA960957 H 

MGGNHSHKPPVFDENEEVNFDHFQILRAIGKGSFGKVCIVQKRDTKKMYAMKYMNKQKCI 
ERDEVRNVFRELQIMQGLEHPFLVNLWYSFQDEEDMFMWDLLLGGDLRYHLQQNVHFTE 

GTVKLYI CELALALE YLQRYH 1 1 HRD I KPDNI LLDEHGHVH I TDFN I AT WKGAERAS SM 
AGTKPYMAPEVFQVYMDRGPGYSYPVDWWSLGITAYELLRGWRPYEIHSVTPIDEILNMF 
KVERVHYSSTWCKGMVALLRKLLTKDPESRVSSLHDIQSVPYLADMNWDAVFKKALMPGF 
VPNKGRLNCDPTFELEEMILESKPLHKKKKRLAKNRSRDGTKDSCPLNGHLQHCLETVRE 
EFI I FNREKLRRQQGQGSQLLDTDSRGGGQAQSKLQDGCNNNLLTHTCTRGCSS 

mqstsnhlSllsd?^ 

KLNHKNIVKLFAIEEETTTRHKVLIMEFCPCGSLYTVLEEPSNAYGLPESEFLIVLRDVV 

SSwlrengivhrdikpgnimrvigedgqsvykltdfgaareleddeqfvslygteeyl 

Sera^rkdhqkkygatvdlwsigvtfyhaatgslpfrpfegprrnkevmy^^ 

Spsgaisgvqkaengpidwsgdmpvscslsrglqvlltpvlanileadqekcwgfdqffa 

eS^Sihvfslqqmtahkiyihsyntatifhelvykqtkiissnqeliyegrrlv 

lepgrSq^pktteenpifwsreplntigliyekislpkvhprydldgdasi^ 

wcSas?lll?qelmrkgirwlielikddy^ 
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FIGURE IB 

GGLRNVDCL 

SEQ ID NO: 127_AA305176_H SIVKPISRG AFGKVYLGQKGGKLYA 
TSlYGYFDEEMAVKYISEVAIAUDYLH^GIIHRDL^ knqdpaNILSACLS 

SsqStfxssvesechsspkwekdcqv 

PQPDDETDTSYFEARMNAQHLTVSGFSL 
X^RRSQHARKETEFLRLKRTRLGLDDFESL^IGR^ 

Sf?r^thbppsdfsfqnmhskr^™kw^Q^t SEKAKDLIIiR 

SEQ ID NO: 130_AA210825_H TQRLL p S TPKDPAlLRSPPPARSFLG 

KSSSSSSSS3S5SSSSS ° Q f«— 

CPPQDHDMQGLAERI SVL 

SEQ ID NO: "^^""^^LSQKTKVIKKTLTPTVmETFFVHFFEKTTLEL 

a,/ Hi 
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FIGURE 1C 



QGLKCEGCGLNYHKRCAFKI ^^^^Y^^pu^^^g y^px^CQYCKRLLKGLFRQGM 
EMILSSEKSRLPERITIO^ILVM^NLHFralVHC^K SGTFPFNEDE 

F °S™p~s^ 

LREF^TRI^E^ITHESDDARWEIHAYTHNLiVYPKHFIMAPNPDDMEEDP 
AAAI^SSPSEGWLRTKAKHQRGR^ 

IESLYCEKRILEAVGCTGHPFLLSLLVCFQTSSHAKt CKEG IGFGDRTSTFC 
ARFYVAC WLGLQFLHEKKI I ^^^^^ yemLVGEC PF PG^TEEEVFDC I VNMDAPY PG 




RLFAAELVL^CYLHDI^imRDV^ENI^RGH^ RDHVWttASV THSDSEIPAS 
SAETMPFDDFDCDLESFLLYPI PA 
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FIGURE ID 

SEO ID NO- 136 AA476563_H 

meffridskdsaseIlgldfgeklyslkseplkpfftlpdgdsasrsfntseskvefkaq 

riTI SRGSDDSVPVI SFKDAAFDDVSGTDEGRPDLLVNLPGELESTREAAAMGPTKFTQTN 
I GI I ENKLLEAPDVLCLRLSTEQCQAHEEKGI EELSDPSGPKS YS I TEKHYAQEDPRMLF 
VAAVDHSSSGDMSLLPSSDPKFQGLGWESAVTANNTEESLFRICSPLSGANEYIASTDT 
LKTEEVLLFTDQTDDLAKEEPTSLFQRDSETKGESGLVLEGDKEIHQIFEDLDKKLALAS 

RFY I PEGC I QRWAAEMWALDALHREG I VCRDLNPNNI LLNDRGH I QLTYFSRWS EVEDS 
CDSDAIERMYCAPEVGAITEETEACDWWSLGAVLFELLTGKTLVECHPAGINTHTTLNMP 

ECVSEEARSLIQQLLQFNPLERLGAGVAGVEDIKSHPFFTPVDWAELMR 
qEO ID NO- 137 AA626690_H 

MLPFAPQDEPWDREMEVFSGGGASSGEVNGLKMVDEPMEEGEADSCHDEGWKEIPITHH 
VKEGYEKADPAQFELLKVLGQGSFGKVFLVRKKTGPDAGQLYAMKVLKKASLKVRDRVRT 
KMERDILVEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDVFTRLSKEVLFTEEDVKFYLA 
ELALALDHLHQLGIVYRDLKPENILLDEIGHIKLTDFGLSKESVDQEKKAYSFCGTVEYM 
APEWNRRGHSQSADWWSYGVLMFEMLTGTLPFQGKDRNETMNMILKAKLGMPQFLSAEA 
OSLLRMLFKRNPANRLGSEGVEEIKRHLFFANIDWDKLYKREVQPPFKPASGKPDDTFCF 

Speftaktpkdspglpasanahqlfkgfsfvatsiaeeykitpitsanvlpivqingnaa 

OFGEVYELKEDIGVGSYSVCKRCIHATTNMEFAVKI IDKSKRDPSEEIEILMRYGQHPNI 

itlkdvfddgryvylvtdlmkggelldrilkqkcfsereasdilyvisktvdylhcqgw 

HRDLKPSNILYMDESASADSIRICDFGFAKQLRGENGLLLTPCYTANFVAPEVLMQQGYD 
AACDIWSLGVLFYTMLAGYTPFANGPNDTPEEILLRIGNGKFSLSGGNWDNISDGAKDLL 
SHMLHMDPHQRYTAEQILKHSWITHRDQLPNDQPKRNDVSHWKGAMVATYSALTHKTFQ 

PVLEPVAASSLAQRRSMKKRTSTGL 
SEO ID NO: 138 AA215680_H 

MSLVACECLPSPGLEPEPCSRARSQAHVYLEQIRNRVALGVPDMTKRDYLVDAATQIRLA 
LERDVSEDYEAAFNHYQNGVDVLLRGIHVDPNKERREAVKLKITKYLRRAEEIFNCHLQR 
PLSSGASPSAGFSSLRLRPIRTLSSAVEQLRGCRWGVIEKVQLVQDPATGGTFWKSLP 
RCHMVSRERLTI IPHGVPYMTKLLRYFVSEDSIFLHLEHVQGGTLWSHLLSQAHSRHSGL 
SSGSTQERMKAQLNPHLNLLTPARLPSGHAPGQDRIALEPPRTSPNLLLAGEAPSTRPQR 
EAEGEPTARTSTSGSSDLPKAPGGHLHLQARRAGQNSDAGPPRGLTWVPEGAGPVLGGCG 
RGMDOSCLSADGAGRGCGRATWSVREEQVKQWAAEMLVALEALHEQGVLCRDLHPGNLLL 

DQAGHI RLTYFGQWSEVEPQCCGEAVDNLYSAPEVGG I SELTEACDWWSFGSLLYELLTG 
MALSQSHPSGIQAHTQLQLPEWLSRPAASLLTELLQFEPTRRLGMGEGGVSKLKSHPFFS 

TIQWSKLVG 

SEO ID NO- 13 9 SGK H 

MTVKTEAAKGTLTYSRMRGMVAILIAFMKQRRMGLNDFIQKIANNSYACKHPEVQSILKI 
SQPQEPELMNANPSPPPSPSQQINLGPSSNPHAKPSDFHFLKVIGKGSFGKVLLARHKAE 

E V FY AVKVLQKKA I LKKKEE KH IMS ERNVLLKNVKH P F LVGLHF S F QTADKL Y FVLD Y I N 
GGELFYHLQRERCFLEPRARFYAAEIASALGYLHSLNIVYRDLKPENILLDSQGHIVLTD 

FGLCKENIEHNSTTSTFCGTPEYLAPEVLHKQPYDRTVDWWCLGAVLYEMLYGLPPFYSR 
NTAEMYDNI LNKPLQLKPNI TNSARHLLEGLLQKDRTKRLGAKDDFME I KSHVFFSLI NW 
DDLINKKITPPFNPNVSGPNELRHFDPEFTEEPVPNSIGKSPDSVLVTASVKEAAEAFLG 

FSYAPPTDSFL 

cm ID NO- 140 AA107515 M 

MTVKAEAARSTLTYiRMRG^AlLlAFMKQRRMGLNDFIQKIASNTYACKHAEVQSILKM 

shpSp^anpspppspsqqinlgpssnphakpsdfhflkvigkgsfgkvlla^kae 
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FIGURE IE 

SYAPPVDSFL 

SEQ ID NO: 14 1_AA1 09508 M MT T yrdLKPENILLDCQGHWLTDFGLCKE 

ILDC 

====== 

======== 

VNASVLEADDAFVGFSYAPPSEDLFL 



LFGTVKDDLSFA' 



GYQKHLSSCAAPAPLTSAERELQQIRINEVKTEI 




AELTAEFL 



YDEVHPKQHAFKQAFAKPKGPGGKRGHKRLIRGPGENGDDS 



KDGTDDFYQLKILTLEERGDQGIESQEERM^^LSliHTQ „ 
RTCEIVEDTESSWKKmKRICLVL^C^ 

DAQPMTSLDTAI LAQRYLRK 
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FIGURE IF 

TOIKPENLLVCEYPDGTKSLKI^DFGlATVVEGPMTV^l SRKEIiISQM LQ 
GRRQVWPDCGAGLEVFELGSRELPSHGSWCLP 

SEQ ID NO: ^ 6 _ W30246 -^!^^GPELDRCLSPEGVNGNRCSESFPLLEKYR 
TKSSSSSPTSPGSFRGLKISAQGRSSSNWGGPELD^ 

I GKVI GDGNF AWKECVDRYTGKEF ALK ^^^^^^^^^YNLANALRY LH SL S I VHRD 
VEEMETATDLFLVMELVKGGDL^^^ 

TNl^LKNRGQYVAAGFERFHKLPPYQAFCLSVFRN^ 

Lltekvklqsgawlctleglplsag^lvtgh™^ 

GNEAHLRSGVGTVAGSPKPLGRKA^TCLIV^TLKyyy DVFIACG PEKFRYA 
PHRRKETAGALEVADDEDTQTEEPLDQRAA^ 

qddfvldhsrrrllrehqagfe^rrtrgeekea^ 

EPKTRPEENKPERPSGRKPRPMGII^ 
KRQKQVSPSSDGHFRSQHKRWEQVS 

SEQ ID NO: 148_AA197883 M CDHSL)KCLSS KISERKLPGPWLPAGR 
MPTAPVLRPPPPPATPAPPAPSRPAPP^GHRG^ 

GPLEKPVLGPRGAVMPLFSPQSSLHS^EHSPLKPR^ 

VQTFEQLLSDISEALGFPR^RVR^FTLKGREW 



KESKRKI^EKRPERPSG^PRPKGIIS^™^^ ETEfiEIYUME YVQGGDLFDA 




NSQKEESPNSLGHFQSOHKKVAEQMP 
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FIGURE 1G 

LFHPEETSSSSQTQDHSVRSSEDKTSKSSCNGTCGDREDKEN1PEDSSMVSKRFRFDDSL 
PNPHELVSDLLC 

QEO ID NO: 150 W44160_M DRAK2_M rRGKFAWRQClSKSTGQEYA 
^RRPDCRSVSGLiTTTPQTPIKT^™— 

AKSLKKRRRGQDCRAEILHEIAVLELARSCPHVl^ ILLESIYPLGD IKIVDF 



QETYLNI 
LFHPEET 
PSPHELVPDLFC 



SEQ ID NO: 151JM1248 H , DRAK1 H pQARGLLTE IRAWRTEPFQDGYS 
M I PLEKPGSGG S S ^^^^ ^^??^pmr^RKGQDCRME 1 1 HE I AVLELAQDNPW 

LCPGRELGRGKFAWRKC I ^^P^^? p^py^^g^p^KDVQRLMRQI LEGVHFLHTR 
VINLHEVYETASEMILVLEYA^GGE^DQCVADRE^A^ 

DVWLDLKPQNILLTSESPLGD^ 
S^TDMWSIGVLTYVMLTGISPFLGNDKQE^ 

IcHPHIIRLYQVMETERMIYLVTEY^ 
CHCR^IVHRDLKAENLLLDANLNIKIADFGFS^ 

SPPPNHPlONHLFRQPSNSPPPMSSAMIQPHGAASSSQFy SPSAG Q M QMQHRTNL 

v^tclgmqqpaqsqqvtiW 

MATLSYGHRPLSKQLSADSAEAHSI^RFSPANy £ p T p PDY TRHQQVPHILQ 

gvgfsptqalkvppldqfptfppsahqqpphyt^^ 

^ITWKRJGFVRIRGE^GNF^V^IHEW^^^^^^ 
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FIGURE 1H 

^^^^^^^^ 
RVYRGIRHTSKFCSIL 

SKIANELKL 

SEQ ID NO: l 55 - W90839 -^ nDm7rNYRLLRT IGKGNFAKVKLARHILTGREVAIK 

l^GKYRVPFYMSTDCESILRRFLV^PA^CTLEQIMKU STPNNLP p SMM TRRN 
RMPGRKASCSAVGSGSRGLPPSSP^SSA^^ 

SEQ ID NO: 156_4 06786 . 5 H TAEPSRSFS SAHRHLSRRNGLSRLCQS 
MEVGGLTVFEEDQRCLSQSLPLPVS AEGPAAQTTAEP S p R Qgyg^Q gLLRGLS SGW S S 
RTALSEDRWSSYCLSSL^QNICTS^P^PEHTDPS QFFLRSDSDV VE 

p LL PAPVCNPNKAIFTVDAKTTE ^^^^KRM RQE RRLCCVVVLEPVERVST 

ALSEEHMEADGHAAWFGTVWIITRSGEKIPV^ 

^AFQSDGTITSCDSLFAHLHGYVSGEDVAGQHIT^I^ 

GRARDGTTFPLSLKLKSQPSSEEATTGE^ 

NHSFALTLFGYGKTELLGKNITFLIPGFYSYMDIA^ SQDIFTGTQT ELIAGGQLLSC 

8/10 
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FIGURE II 

HPGDPRLLTS 

SEQ ID NO: 157_RA544838 M 4061 "-^ viXedftiK I,IDFGSAML 

ergk^tf^tieyc™ 

GLLS^S^^GSRSPSEM^REGLCGPPAPRETRGDQHCLHLKDPSLPVS 

^pS^S 

"" rT ' lc, ^ n ' A ' - iGEFNEQV 



AEQTVAKAQTVGLPVTMHSPNM^ 

PK^GCLLDPVPPVLWKGCQSLPS^ETSIMGLE^ 
WTCTLSEVTNQLVVM^ 

ANQP S PRMTS PF I ^^^^^^?^Qp^^^^^pAAPQLQDLAS S C PQEEV SQQQESVST 
HLQNIARTKGILELNKVQLLYEQIGPEADP^ 

LPASVHPQLSPRQSLETQYLQHRLQKPSLLSKAQN1 u QQA pp FSLT QPLSPVLEP 
RLFLQKQSQLQAYFNQMQIAESSYPQPSO^PL^ 

pScS 

SELPGLFDCEMLDAVDPQHNGYVLVN 

SEQ ID NO: 159_AA207220 H KKQAVKRHH HKHNLRHRYEFLETLG 
MESLVFARRSGPTPSAAEI^RP^EGLIKSPKPLMK ^ LNHPHIIAIH EVFE 

KGTYGKVKKARESSGRLVAIKS^ 

NSSKIVIVMEYASRGDLYDYISERQQL^ 

ILLDANGNIKIADFGLSNLYHQGKFLQTFW 

YILVHGTMPFDGHDHKILVKQISNG^ 
HWWVNWGYATRVGEQEAPHEGGHPGS^DSARAS 

LGDSCFSLTDCQEVTATYRQALRVCSKLT 

SEQ ID NO: ^0_AA426580 H, pAWVS6VPRERLR DFQHHKRVGN 
MPAAAGDGLLGEPAAPGGGGGAED^RPA^^ 
YLIGSRKLGEGSFAlOmEGLHyLTGE^AI^ 
PNITQLIJDIl^TENSYYL^ELCPGGNLMHKIYE^ 

GWHRDLKIENLLLDEDNN ^ ^^^pppg^^YQK^DKEI^PLPTQLSTGAI S FLRSL 

V H!) 
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FIGURE 1J 

DVINTVLSNRACHILAIYFLLNKKLERYLSGKSDIQDSLCYKTRLYQIEKYRAPKESYEA 
SLDTWTRDLEFHAVQDKKPKEQEKRGDFLHRPFSKKLDKNLPSHKQPSGSLMTQIQNTKA 
LLKDRKASKSSFPDKDSFGCRNIFRKTSDSNCVASSSMEFIPVPPPRTPRIVKKPEPHQP 
GPGSTGIPHKEDPLMLDMVRSFESVDRDDHVEVLSPSHHYRILNSPVSLARRNSSERTLS 
PGLPSGSMSPLHTPLHPTLVSFAHEDKNSPPKEEGLCCPPPVPSNGPMQPLGSPNCVKSR 
GRFPMMGIGQMLRKRHQSLQPSADRPLEASLPPLQPLAPVNLAFDMADGVKTQC 

SEQ ID NO: 161_Z36720_H 

MDTKLNMLNEKVDQLLHFQEDVTEKLQSMCRDMGHLERGLHRLEASRAPGPGGADGVPHI 
DTQAGWPEVLELVRAMQQDAAQHGARLEALFRMVAAVDRAIALVGATFQKSKVADFLMQG 
RVPWRRGSPGDSPEEWVKEEEVCFMPPVPPAPGAAGQSLQKDKGELSAEQGIWATLMTLV 
IMVTAANKERVEEEGGKPKHVLSTSGVQSDAREPGEESQKADVLEGTAERLPPIRASGLG 
ADPAQAWSPGQGDGVPGPAQAFPGHLPLPTKVEAKAPETPSENLRTGLELAPAPGRVNV 
VSPSLEVAPGAGQGASSSRPDPEPLEEGTRLTPGPGPQCPGPPGLPAQARATHSGGETPP 
RAALLKGAVAPGFSRRDLVFPSIFCACLGISIHIQEMDTPGEMLMTGRGSLGPTLTTEAP 
AAAQPGKQGPPGTGRCLQAPGTEPGEQTPEGARELSPLQESSSPGGVKAEEEQRAGAEPG 
TRPSLARSDDNDHEVGALGLQQGKSPGAGNPEPEQDCAARAPVRAEAVRRMPPGAEAGSV 
VLDDSPAPPAPFEHRWSVKETSISAGYEVCQHEVLGGGRFGQVHRCTEKSTGLPLAAKI 
IKVKSAKDREDVKNEINIMNQLSHVNLIQLYDAFESKHSCTLVMEYVDGGELFDRITDEK 

YHLTELDWLFTRQ I CEGVH YLHQHY I LHLDLKPEN I LCVNQTGHQ I KI I DFGLARRYKP 
REKLKVNFGTPEFLAPEWNYEFVSFPTDMWSVGVITYMLLSGLSPFLGETDAETMNFIV 
NCSWDFDADTFEGLSEEAKDFVSRLLVKEKSCRMSATQCLKHEWLNNLPAKASRSKTRLK 

SQLLLQKYIAQRKWKKHFYWTAANRLRKFPTSP 
SEQ ID NO: 162 SGK088_H 

GEMALFECLVAGPTDVEVDWLCRGRLLQPALLKCKMHFDGRKCKLLLTSVHEDDSGVYTC 

KLSTAKDELTCSARLTVRPSLAPLFTRLLEDVEVLEGRAARFDCKISGTPPPWTWTHFG 

CPMEESENLRLRQDGGLHSLHIAHVGSEDEGLYAVSAVNTHGQAHCSAQLYVEEPRTAAS 

GPSSKLEKMPSIPEEPEQGELERLSIPDFLRPLQDLEVGLAKEAMLECQVTGLPYPTISW 

FHNGHRIQSSDDRRMTQYRDVHRLVFPAVGPQHAGVYKSVIANKLGKAACYAHLYVTDW 

PGPPDGAPQWAVTGRMVTLTWNPPRSLDMAIDPDSLTYTVQHQVLGSDQWTALVTGLRE 

PGWAATGLRKGVQHIFRVLSTTVKSSSKPSPPSEPVQLLEHGPTLEEAPAMLDKPDIVYV 

VEGQPASVTVTFNHVEAQWWRSCRGALLEARAGVYELSQPDDDQYCLRICRVSRRDMGA 

LTCTARNRHGTQTCSVTLELAEAPRFESIMEDVEVGAGETARFAVWEGKPLPDIMWYKD 

EVLLTESSHVSFVYEENECSLWLSTGAQDGGVYTCTAQNLAGEVSCKAELAVHSAQTAM 

EVEGVGEDEDHRGRRLSDFYDIHQEIGRGAFSYLRRIVERSSGLEFAAKFI PSQAKPKAS 

ARREARLLARLQHDCVLYFHEAFERRRGLVIVTELCTEELLERIARKPTVCESEIRAYMR 

QVLEGIHYLHQSHVLHLDVKPENLLVWDGAAGEQQVRICDFGNAQELTPGEPQYCQYGTP 

EFVAPEIVNQSPVSGVTDIWPVGWAFLCLTGISPFVGENDRTTLMNIRNYNVAFEETTF 

LSLSREARGFLIKVLVQDRLRPTAEETLEHPWFKTQAKGAEVSTDHLKLFLSRRRWQRSQ 

ISYKCHLVLRPIPELLRAPPERVWVTMPRRPPPSGGLSSSSDSEEEELEELPSVPRPLQP 

EFSGSRVSLTDIPTEDEALGTPETGAATPMDWQEQGRAPSQDQEAPSPEALPSPGQEPAA 

GASPRRGELRRGSSAESALPRAGPRELGRGLHKAASVELPQRRSPGPGATRLARGGLGEG 

EYAQRLQALRQRLLRGGPEDGKVSGLRGPLLESLGGRARDPRMARAASSEAAPHHQPPLE 

NRGLQKSSSFSQGEAEPRGRHRRAGAPLEIPVARLGARRLQESPSLSALSEAQPSSPARP 

SAPKPSTPKSAEPSATTPSDAPQPPAPQPAQDKAPEPRPEPVRASKPAPPPQALQTLALP 

LTPYAQIIQSLQLSGHAQGPSQGPAAPPSEPKPHAAVFARVASPPPGAPEKRVPSAGGPP 

VLAEKARVPTVPPRPGSSLSSSIENLESEAVFEAKFKRSRESPLSLGLRLLSRSRSEERG 

PFRGAEEEDGIYRPSPAGTPLELVRRPERSRSVQDLRAVGEPGLVRRLSLSLSQRLRRTP 

PAQRH PAWEARGGDGES SEGGS SARGS PVLAMRRRLS FTLERLS SRLQRSG S SEDSGGAS 
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FIGURE IK 

GRSTPLFGRLRRATSEGESLRRLGLPHNQLAAQAGATTPSAESLGSEASATSGSSAPGES 

RSRLRWGFSRPRKDKGLSPPNLSASVQEELGHQYVRSESDFPPVFHIKLKDQVLLEGEAA 

TLLCLPAACPAPHISWMKDKKSLRSEPSVIIVSCKDGRQLLSIPRAGKRHAGLYECSATN 

VLGSITSSCTVAVARVPGKLAPPEVTQTYQDTALVLWKPGDSRAPCTYTLERRVDGESVW 

HPVSSGIPDCYYNVTHLPVGVTVRFRVACANRAGQGPFSNSSEKVFVRGTQDSSAVPSAA 

HQEAPVTSRPARARPPDSPTSLAPPLAPAAPTPPSVTVSPSSPPTPPSQALSSLKAVGPP 

PQTPPRRHRGLQAARPAEPTLPSTHVTPSEPKPFVLDTGTPIPASTPQGVKPVSSSTPVY 

WTSFVSAPPAPEPPAPEPPPEPTKVTVQSLSPAKEWSSPGSSPRSSPRPEGTTLRQGP 

PQKPYTFLEEKARGRFGWRACRENATGRTFVAKIVPYAAEGKPRVLQEYEVLRTLHHER 

IMSLHEAYITPRYLVLIAESCGNRELLCGLSDRFRYSEDDVATYMVQLLQGLDYLHGHHV 

LHLDIKPDNLLLAPDNALKIVDFGSAQPYNPQALRPLGHRTGTLEFMAPEMVKGEPIGSA 

TDIWGAGVLTYIMLSGRSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLSV 

HPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKEFLGEQRRRRAEAATRHKVLLR 

SYPGGP 

SEQ ID NO: 163_AA542015_M SGK088_M 

ATDIWGAGVLTYIMLSGYSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLS 
VHPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKEFLGEQRRRRAEAATRHKVLL 

RSYPGSP 

SEQ ID NO: 164_R19772_H 

MKGGDRAYTRGPSLGWLFAKCCCCFPCRDAYSHSSSENGGKSESVANLQAQPSLNFIHSS 
PGPKRSTNTLKKWLTSPVRRLNSGKADGNIKKQKKVRDGRKSFDLGSPKPGDETTPQGDS 
ADESKKGWGEDEPDEESHTPLPPPMKIFDNDPTQDEMSSSLLAARQASTEVPTAADLVNA 
IEKLVKNKLSLEGSSYRGSLKDPAGCLNEGMAPPTPPKNPEEEQKAKALRGRMFVLNELV 
QTEKDYVKDLGIWEGFMKRIEEKGVPEDMRGKDKIVFGNIHQIYDWHKDFFLAELEKCI 

OEQDRLAQLF I KHERKLH I YVW YCQNKPRS EY I VAEYDAY FEE VKQE I NQRLTLSDFL I K 
PIORITKYQLLLKDFLRYSEKAGLECSDIEKAVELMCLVPKRCNDMMNLGRLQGFEGTLT 
AQGKLLQQDTFYVIELDAGMQSRTKERRVFLFEQIVIFSELLRKGSLTPGYMFKRSIKMN 
YLVLEENVDNDPCKFALMNRETSERWLQAANADIQQAWVQDINQVLETQRDFLNALQSP 
IEYQRKERSTAVMRSQPARLPQASPRPYSSVPAGSEKPPKGSSYNPPLPPLKISTSNGSP 
GFEYHQPGDKFEASKNDLGGCNGTSSMAVI KDYYALKENE I CVSQGEWQVLAVNQQNMC 
LVYQPASDHSPAAEGWVPGSILAPLTKATAAESSDGSIKKSCSWHTLRMRKRAEVENTGK 
NEATGPRKPKDILGNKVSVKETNSSEESECDDLDPNTSMEILNPNFIQEVAPEFLVPLVD 
VTCLLGDTVILQCKVCGRPKPTITWKGPDQNILDTDNSSATYTVSSCDSGEITLKICNLM 
PODSGIYTCIATNDHGTTSTSATVKVQGVPAAPNRPIAQERSCTSVILRWLPPSSTGNCT 
ISGYTVEYREEGSQIWQQSVASTLDTYLVIEDLSPGCPYQFRVSASNPWGISLPSEPSEF 
VRLPEYDAAADGATISWKENFDSAYTELNEIGRGRFSIVKKCIHKATRKDVAVKFVNKKM 
KKKEQAAHEAALLQHLQHPQYITLHDTYESPTSYILILELMDDGRLLDYLMNHDELMEEK 

VAFY I RD I MEALQYLHNCRVAHLD I KPENLL I DLR I PVPRVKL I DLEDAVQ ISGHFHIHH 
LLGNPEFAAPEVIQGIPVSLGTDIWSIGVLTYVMLSGVSPFLDESKEETCINVCRVDFSF 
PHEYFCGVSNAARDFINVILQEDFRRRPTAATCLQHPWLQPHNGSYSKIPLDTSRLiACFI 

ERRKHQNDVR P I PNVKSYI VNRVNQGT 
SEO ID NO: 165 5R72_8_2_H 

MADSGLDKKSTKCPDCSSASQKDVLCVCSSKTRVPPVLWEMSQTSSIGSAESLISLERK 
KEKNINRDITSRKDLPSRTSNVERKASQQQWGRGNFTEGKVPHIRIENGAAIEEIYTFGR 

I LGKGS FG I VI EATDKETETKWAI KKVNKEKAGS S AVKLLEREVN I LKS VKHEH 1 1 HLEQ 
VFETPKKMYLVMELCEDGELKEILDRKGHFSENETRWIIQSLASAIAYLHNNDIVHRDLK 
LENI MVKS £ L I DDNNE I NLN I KVTDFGLAVKKQSRSEAMLQATCGTP I YMAPEV I S AHDY 
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FIGURE 1L 

SOOCDIWSIGWMYMLLRGEPPFLASSEAKLFELIRKGELHFENAVWNSISDCAKSVLKQ 
LM^PAHRITAKELLDNQWLTGNKLSSVRPTNVLEMMKEWKNNPESVEENTTEEKNKPS 

teSSk^qpwgnvpetnytsdeeeekqstayekqfpatskdnfdmcsssftsskllpae 
I kgemektpvtpsqgtatkypaksgalsrtkkkl 

^NYVVMQLQGRNLADLRRSQPRGTFTLSTTLRLGKQI LES I EAIHSVGFLHl^IKPSNF 

amgIlpstyrkcymldfglarqytnttgdvrpprnvagfrgtvryasvnahknremgrhd 

DLWSLFYMLVEFAVGQLPWRKI KDKEQVGMI KEKYEHRMLLKHMPSEFHLFLDH I ASLDY 

^tkpdyqlimsvfensmkergiaeneafdwekagtdallstststpppaehpadgshw^ 
gqcdasawgpLgehrgcatgrapexpgectpnsarealxgagpqsppcppprgsxgx 

pvpspalaslclpsssssvsftlrrpsa 

msgggeqldi ls vgT^kerwkvlrki ggggfge I ydaldmltrenvalkvesaqqpkqv 
SeSaSlkklqgkdhvcrfigcgrndrfnyvvmqlqgrni^dlrrsqsrgtftist 

iSoiiESIESIHSVGSXHRDIKPSNFAMGRFPSTCRKCYMLDFGLARQFTNSCGDVRPP 

RtvAGFRGTVRksiNAHRNREMGRHDDLWSLFYMLVEFWGQLPWRKIKDKEQ 

RYDHRLMLKHLPPEFSIFLDHISSLDYFTKPDYQLLTSVFDNSIKTFGVIESDPFDWEKT 

gnSgslttSts??pqlhtrltpaaigianatpipgdllren 
vgvsId^pgslghprpqekdvweemd^ 

gsp^rvSeitqpdrdiplvrklrsihsfelekrltlepkpdtdkfletwykivyfsf 



^^iffemtvfdleaksarggsnIlmdsvssfqlfmfqllrglayihhqhvlhrdlkpqn 
llishlgelki^fgiaraksipsqtyssewtlwyrppdallgateysseldiwgagci 

fiemfqgqplftcvsnileqlekiwevlgvptedtwpg 

^rSrSpSedlasqmlkgfprdrvsaqealvhdyfsalpsqlyqlpdeeslftvsgv 

RLKPEMCDLLASYQKGHHPAQFSKCW 

A^ATRWYRAPELLVGDTQYGSSVDIWAIGCVFAELLTGQPLWPGKSDVDQLYLIIR 

tSkSprhqsifksngffhgisipepedmetleekfsdvhpvalnfmkgclkmnpddrl 

TCSQLLESSYFDSFQEAQIKRKARNEGRNRRRQ 
FDHLPNI 

KTALREIR^LKL^P^WLIEVFRRKRKMHLVFE 

SotlqaSfS^cihrdvkpenilitkqgmikicdfgfarili^ 
^elS^Sg^vdwavgcvfaelltgqplwpgksdvdqlyliirtlgkliprh^ 
^rISqffrLsipepedmetleekfsnvqpvalsfmkgclkmnpderltcaqlldsayf 
esfqSqmkrkarsegrsrrrqqnqllplipgshisptpdgrkqwqlkfdhlpni 

MPNqFRHGGJaCDGSGG^SGTLQPSSGGGSSNS 
ASLG?v?KP^?DD?sLsD?FSDDMAFKLDRRENDE 

it/" 3 
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FIGURE 1M 

KEKTRKERELKSGHKDRSKSHRKRETPKSYKTVDSPKRRbK RSpS p YS RRQR 
mIrHSSSHSKKKKSSSRSRHSSI^ 

IStdlplppelpggdlsppdspepkaitppqqpvktopkicc eikilrq 
™fdiigiigegtygq = ™ 

T T WRSVVNMKEI VTDKQDALDFKKDKGAFYLV r b i ^rlynsEESRPYTNKVITLW 

yrppelllgeerytpaidwscgcilgelftkkpifq^ SKRCTAEQTL qsdfl 
pdv?Spyfntmkpkkqyrrrlree^ 

KDVELS 




Q^^PMEYSTRPRP^^^^^^^p^^^p^^Qp^l^EGSSNSVVHAETKLQN^ 

SSEGGLGQQDME^PMSPDT^^rara^^^pQpjQSQMEKGI 
NLTOSHLM^LSSNLFHPSWLTER^RTSSQSIGQ MEVKQIKMLKRESKK 

skiptllsmdpnqekqeggdgdcegknlkrnrfffsr 

qEO ID NO: 174 AA575635_M CCRK -^ 1V _- pf1jLYGA rqyDQGVDLWAVGCIMG 
S ASGQLKI ADFGLARVFS PDGGRLYTHQVATRWYRAPELL^ pEEQAPVPLEEVLP 

Ssplfpgendieql™ 

da POALDLLGQFLLY P PRQRI AAS QALLHQ i r 

gppXfhvdrpieesllnpelirpfipeg 
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FIGURE IN 



SSSKE5S 

DTLGEGAFGKWECIDHG1C.GMHVAVKIVKNVGRYREAWSEK NFLHHNKIj 
VQMLEWFDHHGHVCIVFELLGLSTYDFI^SF^I^RO^™ HHSTLVSTRH 

RITLDEALQHPFFDLLKKK 

PEDQAVTEYVATRWYRAPEVLLS ^^T^f ppnTS PEALDLLRRLLVFAPDKRLSATQAL 
STLHQLELILETIPPPSEEXRPRQTLDALLPPDTSPEAL^ 

QHPYVQRFHCPSDEWAREADVRPRAHEGVQLSVPEYRS^ 

GVSPSQAHLHKPRADPQLPSRTPVQGPRPRP^SPGHD^ 



EGHHV 



SEQ ID NO: ^ c^vM^Mn^LRDGNYYACKQMKQRFES I EQVNNLRE I QALRRLNPH 

MKNYKAIGKIGEGTFSEVM^QSLRDGNYYACKQM^ 

PNILMLHEWFDRKSGSlALICELMD^IYELIRGRRYPb ECL LTDGFYT 

Lgifhrdvkpenilikqdvlklgd^gsc^ 



SEQ ID NO: i" 78 -^ 0231 "-"' ^7tv KMKRKFYSWEECMNQREVKSLKKLNHA 

mnryttirqlgdgtygsvllgrsiesgeliaik^kfysw^ QGLAFIH klg 

QPRSTPGLI PRPPAAQPVHGRTDWASKYPSRR 
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FIGURE 10 

OTGHQIKIIDFGIJ^RYKPREKLKVNFGTPEFLAPE\AmYEFVSFPTDMWSVGVITYMLL 
SGLSPFLGETDAETMNFIVNCSWDFDADTFKGLSEEAKDFVSRLLVKEKSCRMSATQCLK 

HEWLNHLP AKAS G SNVRLRSQQLLQKYMAQ S KWKKHFHWAAVNRLRKF PTCP 

SEO ID NO: 180 AA460132_H nnm „ WTV 
^A^TTPADGEEiAPEAEAL^ERSSRFLSGLELVKQGAEARVFRGRFQGR^VIK 

Xp^yLpalfarlgrrrtvqearallrcrragisapwffvdyasnclymeeie^ 

?WDYICTETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNIV 

™?glsfSSpedkgvdlyvlekaflsthpntetvfeaf 
evrlrgrkrsmvg 



OftEKWQGNMPGLQSTFLAMDTEEGVEVVVWELHFGDRKAFA 

?n?™kywldtseacarvifiteyvssgslkqflkktkknhkamnarawkrwctqils 
nlSfSe?ge™g?avdifsfg M calemavle 

refI^cSrdparrpsahsllfhrvlfevhslkli^cfiqhqylmpenweekt^ 

?pe^vo^k?ptpepfdsetrkviqmqcnlersedkar W hltlllvledrlhrqltydll 
ptdsaqdlaselvhygflheddrmklaaflestflkyrgtqa 

cx?n td NO- 182 AA103218 M SGK034 M 

APE YGEVNDGTGFVDI FS FGMCALEMAVLE I QANGDTRVTEEAI ARARHS LSDPNMR 

S^lscSrd^rrpsahnllfhrvlfevhslkliaahcfiqhqylmpenweekt^ 

f^V^POPHGPPMQWRYSEVSFLELDKFLEDVRNGIYPLMNFAAARPLGLPRVLAPP 

pe^Iq^SSepfdsetrkwqmqcnlersedkarwhlt 

TDSAQDLAAELVHYGFLHEDDRTKLAAFLETTFLKYRGTQA 

S^g^Stt^ehrffrrsvicdsnatalelpglplslpqpsip^vpqsappeph 

ARqrqGGGSAKEPQEERSQQQDDIEELETKAVGMSNDGRFLKFDIEIGRGSFKTVYKGLD 
^TTVEVAWCELQDRKLTKSERQRFKEEAEMLKGLQHPNI WFYDSWESTVKG 

™Sgtlk?ylkrfkvmkikvlrsw 

PYSECQNAAQIYRRVTSGVKPASFDKVAI PEVKEI IEGCIRQNKDERYSIKDLLNHAFFQ 

eSgwve^eLdgekiaiklwlriedikklkgkykdneaiefcfdlerd^ 
vIsgy^Sg^hkt^kaikdrvslikrkreqrqlvreeqenkkqeessl^ 
tc^pI^ptasttsasvstqvepeepemwhcxjwyqqpsisvlsixstvdsgqg 
^SesrvIsqUsygfpxheqahstgtvpghipstvqaqsqphgvyppssvqqgiqq 

?appqqSq?sSqStsseattaqpvsqpqap 

IqpXp?tlassvdsahsdvasgmsdgnenvpsssgrhegrttkrhy = 
$?srpSrilnvsnkgdrwecqlethnrk^tfkfdldgdnpeeiati^df 

resfvd^vre^ekademlsedvsvepegdq 

SmpSptLltqvvhsagrrfivspvpesrlreskvfpseitdtvaastaqspgmn 
lshsaIslsl^afselr^^ 

apwaSsppSistsviqsevtvpteegiagvatstgwtsgglpippvsespvlssw 

SS^^PAWSISTTSPSLQVPTSTSEIWSSTALYPSVTVSATSASAGGSTATPGPKPPA 

wsqqStvgatltsvstttsfpstasqlsiqlssststptlaeivwsahsldkt 

is/ 113 
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FIGURE IP 

HSSTTGLAFSLSAPSSSSSPGAGVSSYISQPGGLHPLVIPSVIASTPILPQAAGPTSTPL 
LPQVPSIPPLVQPVANVPAVQQTLIHSQPQPALLPNQPHTHCPEVDSDTQPKAPGIDDIK 
TLEEKLRSLFSEHSSSGAQHASVSLETSLVIESTVTPGIPTTAVAPSKLLTSTTSTCLPP 
TNLPLGTVALPVTPWTPGQVSTPVSTTTSGVKPGTAPSKPPLTKAPVLPVGTELPAGTL 
PSEQLPPFPGPSLTQSQQPLEDLDAQLRRTLSPEMITVTSAVGPVSMAAPTAITEAGTQP 
OKGVSOVKEGPVLATSSGAGVFKMGRFQVSVAADGAQKEGKNKSEDAKSVHFESSTSESS 
VLSSSSPESTLVKPEPNGITIPGISSDVPESAHKTTASEAKSDTGQPTKVGRFQVTTTAN 
KVGRFSVSKTEDKITDTKKEGPVASPPFMDLEQAVLPAVIPKKEKPELSEPSHLNGPSSD 
PEAAFLSRDVDDGSGSPHSPHQLSSKSLPSQNLSQSLSNSFNSSYMSSDNESDIEDEDLK 
LELRRLRDKHLKEIQDLQSRQKHEIESLYTKLGKVPPAVIIPPAAPLSGRRRRPTKSKGS 
KSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTLHPPGNIPESGQNQLLQPLKPSPSSDN 

LYSAFTSDGAISVPSLSAPGQGNKATI IVQKQ 
SEO ID NO- 184 BCON3_H 

MSEGESQTVLSSGSDPKVESSSSAPGLTSVSPPVTSTTSAASPEEEEESEDESEILEESP 

CGRWQKRREEVNQRNVPGIDSAYIJVMDTEEGVEV^EVQFSERKIWKLQEEKVRAVFDN. 

LIOLEHLNIVKFHKYWADIKENKARVIFITEYMSSGSLKQFLKKTKKNHKTl^EKAWKRW 

CTOILSALSYLHSCDPPI IHGNLTCDTIFIQHNGLIKIGSVAPDTINNHVKTCREEQKNL 

HFFAPEYGEVTNVTTAVDIYSFGMCALEMAVLEIQGNGESSYVPQEAISSAIQLLEDPLQ 

REFIOKCLQSEPARRPTARELLFHPALFEVPSLKLLAAHCIVGHQHMIPENALEEITKNM 

DTSAVLAEIPAGPGREPVQTLYSQSPALELDKFLEDVRNGIYPLTAFGLPRPQQPQQEEV 

TSPWPPSVKTPTPEPAEVETRKWLMQCNIESVEEGVKHHLTLLLKLEDKLNRHLSCDL 

MPNENIPELAAELVQLGFISEADQSRLTSLLEETLNKFNFARNSTLNSAAVTVSS 

SEO ID NO- 185 AA711829_M 

LKQFLKKTKKNHKTMNEKAWKRWCTQILSALSYLHSCDPPI IHGNLTCDTIFIQHNGLIK 
IGSVAPDTINNHVKTCREEQKNLHFFAPEYGEVTNVTTAVDIYSFGMCALEMAVLEIQGN 
GESSYVPQEAISSAIQLLEDSLQREFIQKCLQSEPARRPTARELLFHPALFEVPSLKLLA 

AHC I VGHQHMI PENALEE ITKNMDTS AVLAE I PAGPGREPVQTLYSQS PALELDKFLEDV 
RNGIYPLTAFGLPRPQQPQQEEVTSPWPPSVKTPTPEPAEVETRKWLMQCNIESVEEG 
VKHHLTLLLKLEDKLNRHLS CDLMPNE S I PDLAAELVQLGF I SEADQSRLTS LLEETLNK 
FNFTRNSTLNTATVTVS S 

qfo TD NO- 186 AA0991O2 H 

msscvssqpssnraKpqdelggr5ssssesqkpcealrglsslsihlgmesfiwtecep 
gca^lgSrdrpleadgqevpldtsgsqarphlsgrklslqersqgglaaggsldmngr 

ScP^LPYSPVSSPQSSPRLPRRPTVESHHVSITGMQDCVQLNQYTLKDEIGKGSYGWK 
LAYNENDNTYYAMKVLSKKKLIRQAAFPRRPPPRGTRPAPGGCIQPRGPIEQVYQEIAIL 
KKLDHPNWKLVEVLDDPNEDHLYMVFELVNQGPVMEVPTLKPLSEDQARFYFQDLIKGI 

eylhyqkiihrdikpsnllvgedghikiadfgvsnefkgsdallsnyvgtpafmapesls 
etrkifsgkakdvwamgvtlycfvfgqcpfmderimclhskiksqalefpdqpdiaedlk 
dlitrmldknpesrivvpeiklhpwvtrhgaeplpsedenctlvevteeevensvkhips 
latvilvktmirkrsfgnpfegsrreerslsapgnlltkkptreceslselkearqrrqp 
pghrpaprggggsalvrgspcvescwapapgsparmhplrpeeamepe 

cfo TD NO- 187 5R69 17 2 H 

moeipoeoikeikkeqlsgspwYllrenevstlykgeyhrapvaikvfkklqagsiaivr 
q??nSmkkfespnilrifgicidetvtpp 
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FIGURE 1Q 

L 

SEQ ID NO: 1 8 8_H8 5 8 1 *J „„. EPSSNWDMTGYGSHSKVY SQSKNI PLS 

mPVYEGMASHVQVFSPHTLQSSAFCSV^KIEPSSNWDMl GPHN LMRRSTV 

AEYLLSAGTKTTRF FNRDTD S P Y ^^^^ ^ ^^^^p j ETLNHP F VTMTHLLDF PH STH 
AAVAQRSMPLQTGTAQ I ^^^^Q^^pp^QQLTGVATHT SVQHATV I PETMAGTQQ 

ftj^TYISASPASTVYTGYPLSPAKVNQYPYI 




AYEKLEIINyPEIYFVGPNAK^v^.^--- LEHLKXCS)KTG SHNVIHML 
ESFTFRinWC^FE^SIDLYELI^KFQGF^ yRA p EIILGSR YSTPIDIWSF 

SIPLCSVLPKLIS 

SEQ ID NO: WJ*™l 2 *l -I ^SVST QTDGRWLLGGRSRRGKKRGPPG 

TR p ELL GMPPQ^EQS^~ 

SEQ ID NO: l91 - 5R72 -^-^ T OALEAIYG ADFQDLRPDACGPVKEPPEINLVLY 

MAGGRGAPGRGRDE P PE S Y p^^p^^^^gLSNE SVNLLKS RLEELAKKHCGE 
PQGLTGEEVYVKVDLRVKCPP^PDWPEI^ 

^ IFE I A YHVQSFLSEHNKPPPKSFHE^ 

RRKEEIKEEKKRKEMAKQERLEIASLS^ 
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FIGURE 1R 

YLAMNLKEQDDS I WD I LVEH I SGVSLAAHLSHSGPI PVHQLRRYTAQLLSGLDYLHSNS 

WHKVLS ASNVLVDAEGTVKI TDYSIS KRLADI CKEDVFEQTRVRFSDNALP YKTGKKGD 

VWRLGLLLLSLSQGQECGEYPVTIPSDLPADFQDFLKKCVCLDDKERWSPQQLLKHSFIN 

PQPKMPLVEQSPEDSGGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGA 

FGAVIKVQNKLDGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHENIVRYYNAWIERHE 

RPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAAPPPILSSSVEWSTSGERSAS 

ARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDI I FDNEDENSKSQNQDEDCNEK 

NGCHES E PS VTTEAVHYLY I QMEYCEKSTLRDT I DQGL Y RDT VRLWRLFRE I LDGLAY I H 

EKGMIHRDLKPVNIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTG 

MVGTALYVSPEVQGSTKSAYNQKVDLFSLGIIFFEMSYHPMVTASERIFVLNQLRDPTSP 

KFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSELLPPPQMEESELHEVLHHTLT 

NVDGKAYRTMMAQIFSQRISPAIDYTYDSDILKGNFSIRTAKMQQHVCETIIRIFKRHGA 

VQLCTPLLL PRNRQ I Y EHNEAALFMDH SGMLVML PFDLR I PFAR YVARNN I LNLKRY C I E 

RVFRPRKLDRFHPKELLECAFDIVTSTTNSFLPTAEI IYTIYEI IQEFPALQERNYSIYL 

NHTMLLKAI LLHCG I PEDKLSQVY I I LYDAVTEKLTRREVEAKFCNLSLSSNSLCRLYKF 

IEQKGDLQDLMPTINSLIKQKTGIAQLVKYGLKDLEEWGLLKKLGIKLQVLINLGLVYK 

VQQHNGI IFQFVAFIKRRQRAVPEILAAGGRYDLLIPQFRGPQALGPVPTAIGVSIAIDK 

ISAAVLNMEESVTISSCDLLWSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQ 

EYCRHHEITYVALVSDKEGSHVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDERNG 

REASDNLAVQNLKGSFSNASGLFEIHGATWPIVSVLAPEKLSASTRRRYETQVQTRLQT 

SLANLHQKSSEIEILAVDLPKETILQFLSLEWDADEQAFNTTVKQLLSRLPKQRYLKLVC 

DEIYNIKVEKKVSVLFLYSYRDDYYRILF 
SEQ ID NO: 192_R43524_H, HRI_H 

MLGGNSGVRKREEEGDGAGAVAAPPAIDFPAEGPDPEYDESDVPAEIQVLKEPLQQPTFP 
FAVANQLLLVS LLEHLSHVHE PNPLRSRQVFKLLCQTF I KMGLLS S FTC SDE FS SLRLHH 
NRAITHLMRSAKERVRQDPCEDISRIQKIRSREVALEAQTSRYLNEFEELVILGKGGYGR 

VYKVRNKLDGQYYAI KKI L I KGATKTVCMKVLREVKVLAGLQHPN I VGYHTAW I EHVHV I 
QPRADRAAIELPSLEVLSDQEEDREQCGVKNDESSSSSI IFAEPTPEKEKRFGESDTENQ 
NNKSVKYTTNLVIRESGELESTLELQENGLAGLSASSIVEQQLPLRRNSHLEESFTSTEE 
SSEENVNFLGQTEAQYHLMLHIQMQLCELSLWDWIVERNKRGREYVDESACPYVMANVAT 

KI FQELVEG VF Y I HNMG I VHRDLKPRN I FLHGPDQQVKI GDFGLACTDI LQKNTDWTNRN 
GKRTPTHTSRVGTCLYASPEQLEGSEYDAKSDMYSLGWLLELFQPFGTEMERAEVLTGL 
RTGQLPESLRKRCPVQAKYIQHLTRRNSSQRPSAIQLLQSELFQNSGNVNLTLQMKIIEQ 

EKE I AELKKQLNLLSQDKGVRDDGKDGGVG 
SEQ ID NO: 193_17000057519457_H 

MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 
HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGISAPWFFVDYASNCLYMEEIEGSV 
TVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNIV 
LIDFGLSFI S ALPEDKG VDL YVLE KAFLS TH PNTETVFEAFLKS Y S TS S KKAR P VLKKLD 
EVRLRGRKRSMVG 

SEQ ID NO: 194_AA013524_M 

LVQQGAEARVFRGRFQGRAAWKHRFPKSYRHPELEARLGRRRTVQEARALLRCRRAGIA 
APWFFVDYASNCLYMEEIEDSVTVRDYIQSTMETEKDPQCLLDLARRMGQVLAGMHDQD 
LI HGDLTTSNMLLRRPLAQLH I VL I DFGLS FVSGLPEDKGVDLY VLEKAFLSTH PHTETA 
FEAFLKSYGASSKKSSPVLKKLDEVRLRGRKRSMVG 
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FIGURE IS 

SEO ID NO: 195 17000139801197_H, IRAKM_H 

MAGNCGARGALSAHTLLFDLPPALLGELCAVLDSCDGALGWRGLAERLSSSWLDVRHIEK 

^grcStrISrpsmdevlntlestqaslyfaedpptslksfrcpsplflenvpsipve 
d^esqn^nSpsdeglridrmtqktpfecsqsevmflsldkkpeskrneeacnmpsssce 
eswf^vpsqdlrpykvnidpsseapghscrsrpvesscsskfswdeyeqykke 

<5EO ID NO: 196 AA840598 M IRAKM M mmr , T 

mw^fLelevlllfrhphilei^^ 

h^svligiakaiqylhntqpcavicgnvssanillddqlqpkltdfaaahfrpnleqq 

sSSgggrkhlSympeeyirqgrlsvktdvysfgivimevltgckvvl^ 
dllmelmekrgldsclsfldrkippcprnfsaklfslagrcvatkaklrptmdevlssle 

stopslyfaW^ 

?p?ecsqsevtflgldrnrgnrgseadcnvpsssheecwspelvapsqdlspivislgss 



wevpghsygskpmekrcssglfcseheqskkq 

ci?r> m NO- 197 AA088547 H 

mIsavrgsrpwprlglqlqfaalIlgtlspqvhtlrpenlllvstldgslhalskqtgdl 



tyrrysappmdgspgkymshlascgmgllltvdpgsgtvlwtqdlgvpvmgvytwhqdgl 

rolphS^S^lhfi^lrwghirlpasgprdtatlfstldtqllmtlyvgk^ 
Stg^lvp^ltlapadgpttdevtlqvsgeregspstavrypsgsvalpsqwlli 

S^™SvHPT1X3SGTAETRPPENTOAPAFFI^LLSLSREKLWDSEI^PEEK 
T PDS YLGLG PQDLLAAS LTAVLLGG W I LF VMRQ WEKQQET P LA PAD F AH I SQDAQ ^ S 

?asrrso^lqspskqaqplddpeaeqltvvgkisfnpkdvlgrgaggtfvfrgqfegra 
Sa™l2Sc?g[wLvqllqesdrhpnvlryfctergpqfhyialelcr^ 

Sgglepe^qqlmsg^ 

kklpagrcsfslhsgipgtegwmapellqllppdsptsavdifsagcvfyyvlsggshpf 
gSyrqani^ 

akoloffotvsdwlekeseqeplvraleaggcawrdnwheh 
lflpyyppdsearrpcpgatgr 

qpn TD NO- 198 HGP 6644466 

MEGISNFKTPSKLSEKKKSVLCSTPTINIPASPFMQKLGFGTGVNVYLMKRSPRG 
WAVKKINPICNDHYRSVYQKRLMDEAKILKSLHHPNIVGYRAFTEANDGSLCL^ 

TI Kit CDVGVS LPLDENMTVTDPEACY I GTE PWKPKEAVEENGV I TDKAD I FAFGLTLWEM 

mtS^mnlsnddddedktfdesdfddeayyaalgtrppinmeeldesyqkvielfsvc 

TNEDPKDRPSAAHIVEALETDV 
c;pn ID NO: 199 AA449542 M 

sprglshspwavkkTsllcddhyrtvyqkrltdeakilknlnhpni IGYRAFTEASDGSL 
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FIGURE IT 

NWI KGDFET I KI CDVGVSLPLDENMTVTDPEACY I GTEPWKPKEALEENG 1 1 TDKADVF 
AFGLTLWEMMTLCI PHVNLPDDDVDEDATFDESDFDDEAYYAALGTRPS INMELDDSYQK 

AIELFCVCTNEDPKDRPSAAHIVEALELDGQCCGLSESKH 

SEQ ID NO: 200_5R57_10_2_M TESK2_M 
LLDSDLYLPWTVRVKLAYGIAVGLSYLHFKGIFHRDLTSKV 

SEQ ID NO: 201_AA232253_H 

MSSLGASFVQIKFDDLQFFENCGGGSFGSVYRAKWISQDKEVAVKKLLKIEKEAEILSVL 
SHRNIIQFYGVILEPPNYGIVTEYASLGSLYDYINSNRSEEMDMDHIMTWATDVAKGMHY 
LHMEAPVKVIHRDLKSRNWIAADGVLKICDFGASRFHNHTTHMSLVGTFPWMAPEVIQS 
LPVSETCDTYSYGWLWEMLTREVPFKGLEGLQVAWLWEKNERLTIPSSCPRSFAELLH 
OCWEADAKKRPSFKQIISILESMSNDTSLPDKCNSFLHNKAEWRCEIEATLERLKKLERD 
LSFKEQELKERERRLKMWEQKLTEQSNTPLLPSFEIGAWTEDDVYCWVQQLVRKGDSSAE 
MSVYASLFKENNITGKRLLLLEEEDLKDMGIVSKGHIIHFKSAIEKLTHDYINLFHFPPL 
I KDSGGEPEENEEKI VNLELVFGFHLKPGTGPQDCKWKMYMEMDGDE I A I T Y I KDVTFNT 
NLPDAEILKMTKPPFVMEKWIVGIAKSQTVECTVTYESDVRTPKSTKHVHLIQWSRTKPQ 
DEVKAVQLAIQTLFTNSDGNPGSRSDSSADCQWLDTLRMRQIASNTSLQRSQSNPILGSP 
FFSHFDGQDSYAAAVRRPQVPIKYQQITPVNQSRSSSPTQYGLTKNFSSLHLNSRDSGFS 
SGNTDTSSERGRYSDRSRNKYGRGSISLNSSPRGRYSGKSQHSTPSRGRYPGKFYRVSQS 
ALNPHQSPDFKRSPRDLHQPNTIPGMPLHPETDSRASEEDSKVSEGGWTKVEYRKKPHRP 

S PAKTNKERARGDHRG WRNF 

SEQ ID NO: 202_AI375137_H 

MGNYKSRPTQTCTDEWKKKVSESYVITIERLEDDLQIKEKELTELRNIFGSDEAFSKVNL 
NYRTENGLSLLHLCC I CGGKKSHI RTLMLKGLRPSRLTRNGFTALHLAVYKDNAELI TSL 
LHSGADIQQVGYGGLTALHIATIAGHLEAADVLLQHGANWIQDAVFFTPLHIAAYYGHE 
OVTRLLLKFGADVNVSGEVGDRPLHLASAKGFLNIAKLLMEEGSKADVNAQDNEDHVPLH 
FCSRFGHHDI VKYLLQSDLEVQPHWNI YGDTPLHLACYNGKFEVAKE 1 1 Q I SGTE SLTK 

EN I FS ETAFHS ACTYGKS I DLVKFLLDQNV INI NHQGRDGHTGLHS AC YHGH I RLVQFLL 
DNGADMNLVACDPSRSSGEKDEQTCLMWAYEKGHDAIVTLLKHYKRPQDELPCNEYSQPG 
GDGSYVSVPSPLGKIKSMTKEKADILLLRAGLPSHFHLQLSEIEFHEI IGSGSFGKVYKG 
RCRNKIVAIKRYRANTYCSKSDVDMFCREVSILCQLNHPCVIQFVGACLNDPSQFAIVTQ 

YI SGGSLFSLLHEQKRI LDLQSKLI IAVDVAKGMEYLHNLTQPI IHRDLNSHNILLYEDG 
HAWADFGESRFLQSLDEDNMTKQPGNLRWMAPEVFTQCTRYTIKADVFSYALCLWEILT 

GE I PFAHLKPAAAAADMAYHH I RPP I GY S I PKP I S SLL I RGWNAC PEGRPEFS EWMKLE 
ECLCNIELMSPASSNSSGSLSPSSSSDCLVNRGGPGRSHVAALRSRFELEYALNARSYAA 

LSQSAGQYSSQGLSLEEMKRSLQYTPIDKYGYVSDPMSSMHFHSCRNSSSFEDSS 
SEO ID NO: 203 H97685_H 

MESERSPLYRQLIDLGYLSSSHWNCGAPGQDTKAQSMLVEQSEKLRHLSTFSHQVLQTRL 
VDAAKALNLVHCHCLD I F INQAFDMQRDLQ I TPKRLE YTRKKENELYE SLMN I ANRKQEE 
MKDMI VETLNTMKEELLDDATNMEFKDV I VPENGEPVGTRE I KCC I RQI QELI I S RLNQA 
VANKLISSVDYLRESFVGTLERCLQSLEKSQDVSVHITSNYLKQILNAAYHVEVTFHSGS 

SVTRMLWEQI KQI I QRI TWVSPPAI TLEWKRKVAQEAI ESLSASKLAKS I CSQFRTRLNS 
SHEAFAASLRQLEAGHSGRLEKTEDLWLRVRKDHAPRLARLSLESRSLQDVLLHRKPKLG 
OELGRGQYGWYLCDNWGGHFPCALKSWPPDEKHWNDLALEFHYMRSLPKHERLVDLHG 

S V I DYNYGGGS S I AVLL I MERLHRDLYTGLKAGLTLETRLQ I ALDWEG I RFLHSQGLVH 
RDIKLKNVLLDKQNRAKITDLGFCKPEAMMSGSIVGTPIHMAPELFTGKYDNSVDVYAFG 
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FIGURE 1U 

ILFWYICSGSVKLPEAFERCASKDHLWNWVRRGARPERLPVFDEECWQLMEACWDGDPLK 

rpIlgivqpmlqgimnrlcksnseqpnrglddst 




SEQ ID NO: 205_AA744236 H ASVFVYKEEN EDKVNKAAKHI,KTL 

ieayvehftqeqlkkvilpqvllgl^ 
fStapsftkntdlslegdp^ 

PEEPENQTVNIQIWPREPCDDVKSQCTTLDVEESSWD gppKVS SQERPLKVPSELG 
SSSJ^SSSS ^^SEfKPsS-XEMVPK^VSPVMOFSS 
KFAAAEITEGEAEGWEEEGELNWEDNNW 

SEQ ID NO: 206_AI052250 jj| HIASGCNGLA WKIFNGTKKSTKQE 




epiqilliflqkmdlllt: 

YPSMKNALIPRIKNACYKHLPLRFV 
LEQYDPPEIJttSSGRVVREKW^ 

CELVGANPKVRPNPARFLQNCR^PGGFMSNRFVE^ 

TDRAMR I RLLQQMEQF I pTY^S^iy q^qjq q g YL S ASTRHRVLT S AF SRATRDP F 

ERRREMEAKRAERKVAKGPMKLGARKLD 
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FIGURE IV 

SEQ ID NO: 2 08_AA5 99286 H gvENSWQlVRRYSDFD 

MAFMEKPPAGKVLU^ 
LLNNSLQIAGLSLPLPPKKLIG^DRE^ 

PNNYSANYTEIALQQVSMFFRSEPKWEWEPL^IG^ 

ADLGPDKYLSDKDFQCLIKLLPSCLHPYIYRVTFATA^Eb^ 

^PKDPFLKKYCNPKKIQGLELQQIKTYGRQ^ 

CRLLDLENSLLGLPSFYRSYFSQFRKI^ 
PAPSMAWAVLESTLSCEACKNGMPTISRLLQMP^ 

Lkeciekrlieeqkqihqhrrltraqshhgseeer^ 

KYSNSNNSAGSGASSPLTSPSSPTP^ 

SEQ ID NO: 2 1 0_SGK0 2 2 H c KKHORKVAI KV I DKMGGP SEF I QRFLPRELQ 

MEDFLLSNGYQLGKTIGEGTYSKVK^FSK^^ 

IVRTLDHKNI IQVYEMLESADGKI ^^XpKSHRELSQTFCGSTAYAAPEV 
LLKRLLEPDMILRPSIEEVSWHPWLAST 

SEQ ID NO: 211_AA060026_M SGK022 M GGPEEFIQ RFLPRELQ 

mSfllsngyqlgktigegt™ 

IVRTLDHKNI IQVYEMLESADGKI ^^f^p^pKSRRELSQTFCGSTAYAAPEV 

™= 

LLKRLLEPDMILRPSIEEVSWHPWLAST 




mddatvlrkkgyivg^lgkgsyak^ 



VI') 
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FIGURE 1W 

LSILRGVRHPHIVHVFEFIEVCNG^YIWE^TD^gA^ TycGSAAYAS P 

— — — — ^ 

SISfspsarpsagqvrrkcwlhagdsg 
Ss™Sllepd M ilrps I e E vs«hp«last 

SEO ID NO, 21 6 - H2997 1-" , 0 ,,, VKKIRCDA pENVEUiIAEFWALTSLKRRHQNV 
VQFEECVLQRNGUVQF^SHGNKSSQLYL^ 

D^QYVI^I^PDPRT^SFI^TSMAFLH^r™^ ^pEVWEGHYTAKADIFALG 
IKQLLKDMIiAANPQDRPDAFELETRMDQVTCAA 

SEQ ID NO: "V* 49 ""^" SraRSCTQRVLCEKLRPAAQAMDPAGAEVPGEA 
PLLLPPPPAAMETGKENGMRGT^PER^RSPWRVD RVAVKKIRCDAPE HVE 
FUJJRRPDGGGGDVPARPRVSL^EIGRGSYGVVI ^sqlylrlvETSLKGERIL 

SEQ ID NO: 219 -^ 01 *^" rTAARSE EAPARAQAAGMAGPGWGPPRLDGFILT 

MRAAFP AGG AGGS VE P P ^^^^?^^^^^g lnKAS VENLLTE I E I LKG I RH PH I VQL 
ERLGSGTYATVYKAYAK^TREWAIKC^K^ 

KDFQWDSDNIYLlMEFCAGGDLSRFIHTRRIbF yMA p EM VCQRQYDARVDLW 
LKPQNILLSSLEKPHLKLADFGFAQHMSPWD^ 

a3/ll} 
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FIGURE IX 

LHYEVDAQRKEAIKAKVGQYVSRAEELKAIVSSSNQALLRQGTSARDLLREMARDKPRLL 
AALEVASAAMAKEEAAGGEQDALDLYQHSLGELLLLLRSPRAGGGSCFTLRFRTSWPELN 



SEO ID NO: 220 AA311714_H 

MENF I LY EE I GRGS KTWYKGRRKGT I NFVA I LCTDKCRRPE I TNWVRLTRE I KHKN I VT 
FHEW YETSNHLWLVXENLPEDWREFG I DL I SGLHHLHKLG I LFCD I S PRKI LLEG PGTL 
KFSNFCLAKVEGENLEEFFALVAAEEGGGDNGENVLKKSMKSRVKGSPVYTAPEWRGAD 

FSISSDLWSLGCLLYEMFSGKPPFFSESVSELTEKILCEDPLPPIPKDSSRPKASSDFIN 
LLDGLLQRDPQKRLTWTRLLQHSFWKKAFAGADQESSVEDLSLSRNTMECSGPQDSKELL 
QNSQSRQAKGHKSGQPLGHSFRLENPTEFRPKSTLEGQLNESMFLLSSRPTPRTSTAVEV 

S PGEDMTHCS PQKTS PLTKI TSGHLSQQDLE SQMREL I YTDSDLWTP 1 1 DNPKI MKQPP 
VKFDAKILHLPTYSVDKLLFLKDQDWNDFLQOVCSQIDSTEKSMGASRAKLNLLCYLCW 

AGHQEVATRLLHSPLFQLLI QHLRI APNWDI RAKVAHVIGLLASHTTELQENTPWETTS 

SIGIGILNCLVQHSTPVPRQCLVYV 

SEQ ID NO: 221_SGK384_H 

SLAHVLRARQI LTEPEVRDYLRGLVSGLRYLHQRCI LHR 
SEO ID NO- 222 AA210451 M SGK384_M 

MGQQHGTRNGLTHRiLPRGVGLLI^^ALYLCLDQLFISPGRSTADSRRCPPGYFR 
MGRMRNCSRWLSCEELRTEVRQLKRVGEGAVKRVFLSEWKEHKVALSRLTRLEMKEDFLH 
GLOMLKSLQSEHWTLVGYCEEDGTILTEYHPLGSLSNLEETLNLSKYQDVNTWQHRLQL 
AMEYVSIINYLHHSPLGTRVMCDSNDLPKTLSQYLLTSNFSIVANDLDALPLVDHDSGVL 
IKCGHRELHGDFVAPEQLWPYGEDTPFQDDLMPSYNEKVDIWKIPDVSSFLLGHVEGSDM 
VRFHLFDIHKACKSQIPAERPTAQNVLDAYQRVFHSLRDTVMSQTKEML 

SEO ID NO: 223 SGK071_2_H 

EWAVQMMVECMDDiYASQALEELMPLLKLRHAHISWQELFITWNGEISSLYLCLVMEF 
NELSFQEVIEDKRKAKKIIDSEWMQNVLGQVLDALEYLHHLDI IHRNLKPSNIILISSDH 
CKLODLSSNVLMTDKAKWNIRAEEDPFRKSWMAPEALNFSFSQKSDIWSLGCIILDMTSC 

SFMDGTEAMHLRKSLRQSPGSLKAVLKTMEEKQIPDVETFRNLLPLM 
WHITFLRGSFKSSCVSLTLHRQMVPASITDMLLEGNVASILGDAGDTKGERALKLLSMA 
LASYCLVPEGSLFMPLALLHMHDQWLSCDQDRVPGKRDFASLGKLGKLLGPIPKGLPWPP 
ELVEWVTTMELHDRVLDVQLCACSLLLHLLGQALVHHPEAKAPCNQAITSTLLSALQSH 
PEEEPLLVMVYSLIAITTTQESESLSEELQNAGLLEHILEHLNSSLESRDVCASGLGLLW 
ALLLDDPILALQRPRKKRAPNHGKPGKPKNPASTQSI IVNKAPLEKVPDLI SQVLATYPA 
DGEMAEASCGVFWLLSLLGCI KEQQFEQWALLLQS IRLCQDRALLVNNAYRGLASLVKV 
SELAAFKWVQEEGGSGLSLIKETYQLHRDDPEWENVGMLLVHLASYEEILPELVSSSM 

KALLQEIKERFTSSLVSDSSAFSKPGLPPGGSPQLGCTTSGGLE 
SFO ID NO- 224 AA118352 M SGK071_M 

eedpcqkswmapeaEkfsfstksBiwslgciildmatcsflndteamqlrkairhhpgsl 

KPILKTMEEKQIPGTDVYYLLLPFMLHINPSDRLAIKDVMQVTFMSNSFKSSSVALNMQR 

oSpifitdvllegnmanilgswlcasfvndsrhcdsgigsqrlgfdfqsvswtehplkd 
^nfssrpevqlrainklltmpedqlglpwptelleevisiikqhgrildillstcsll 
Slgqalakdpeaeiprssli i sflmdtlrshpnserlvnwynvlai I ssqgqi seel 
eeeglfqiaqenlehfqedrdiclsilsllwsllvdwtvdkepl^ 

edveIaeagcavlwl^ 



ml I/) 
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FIGURE 1Y 

VSELVAFRIWLEEGSSGLHLIQDIYKLYKDDPEWENLCMLLAHLTSYKEILPEMESGG 
IKDLVQVIRGRFTSSLELISYADEILQVLEANAQPGLQEDQLEPPAGQEAPLQGEPLFRP 

SEQ ID NO: 225_018653 . 9_H 

GRGRGAGHARGLGRGPAGRRAEPPRSLSRPGPGPGSRAGPAGRGEGSDAAPAGGSGRGFL 
RLLPAGLRPQRALRSGSEPPRPGQSPEPSPAPGAGRRGGRGELARQIRARYEEVQRYSRG 
GPGPGAGRPERRRLMDLAPGGPGLPRPRPPWARPLSDGAPGWPPAPGPGSPGPGPRLGCA 
ALRNVSGAQYMGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVREFGVRRGCYRLAA 
HKLLKEMVLLERLRHPNVLQLYGYCYQDSEDIPDTLTTITELGAPVEMIQLLQTSWEDRF 
RICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVDGELKVTDLDDARVEETPCAGSTDCI 
LEFPARNFTLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSIVNATGE 
LAWGVDETLAQLEKVLHLYRSGQYLQNSTASSSTEYQCIPDSTIPQEDYRCWPSYHHGSC 
LLSVFNLAEAVDVCESHAQCRAFWTNQTTWTGRQLVFFKTGWSQWPDPNKTTYVKASG 

SEQ ID NO: 226_AA3 96601_M 

TRPGCAALRNVSGAQYVGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVREFGARRG 

CYRLAAHKLLKEMVLLERLRH PNVLQLYGYCYQDS EG I PDTLTTI TELGAPVEM I QLLQT 
SWEDRFRICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVNGELKVTDLDDARVEETPCT 
SSADCTLEFPARNFSLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSI 
VNATGELAWGVDETLAQLETALHLFRSGQYLQNSTSSRAEYQRIPDSAITQEDYRCWPSY 
HHGGCLLSVFNLAEAIDVCESHAQCRAFWTNQTTWTGRKLVFFKTGWNQWPDAGKTTY 

VKAPG 

SEQ ID NO: 227_VRK3_H 

MISFCPDCGKSIQAAFKFCPYCGNSLPVEEHVGSQTFVNPHVSSFQGSKRGLNSSFETSP 
KKVKWSSTVTSPRLSLFSDGDSSESEDTLSSSERSKGSGSRPPTPKSSPQKTRKSPQVTR 
GSPQKTSCSPQKTRQSPQTLKRSRVTTSLEALPTGTVLTDKSGRQWKLKSFQTRDNQGIL 
YEAAPTSTLTCDSGPQKQKFSLKLDAKDGRLFNEQNFFQRAAKPLQVNKWKKLYSTPLLA 
IPTCMGFGVHQDKYRFLVLPSLGRSLQSALDVSPKHVLSERSVLQVACRLLDALEFLHEN 
EYVHGNVTAENIFVDPEDQSQVTLAGYGFAFRYCPSGKHVAYVEGSRSPHEGDLEFISMD 
LHKGCGPSRRSDLQSLGYCMLKWLYGFLPWTNCLPNTEDIMKQKQKFVDKPGPFVGPCGH 
WIRPSETLQKYLKWMALTYEEKPPYAMLRNNLEALLQDLRVSPYDPIGLPMVP 

SEQ ID NO: 228_S71575_M VRK3_M 

IPTCIGFGIHQDKYRFLVFPSLGRSLQSALDDNPKHWSERCVLQVACRLLDALEYLHEN 
EYVHGNLTAENVFVNPEDLSQVTLVGYGFTYRYCPGGKHVAYKEGSRSPHDGDLEFISMD 
LHKGCGPSRRSDLQTLGYCMLKWLYGSLPWTNCLPNTEKITRQKQKYLDSPERLVGLCGR 
WNKASETLREYLKWMALNYEEKPPYATLRNSLEALLQDMRVSPYDPLDLQMVP 

SEQ ID NO: 229_AA4 5427_H 

MGHALCVCSRGTVIIDNKRYLFIQKLGEGGFSYVDLVEGLHDGHFYALKRILCHEQQDRE 
EAQREADMHRLFNHPNILRLVAYCLRERGAKHEAWLLLPFFKRGTLWNEIERLKDKGNFL 

TEDQ I LWLLLG I CRGLEA I HAKG YAHRDLKPTN I LLGDEGQP VLMDLGS MNQAC I HVEG S 
RQALTLQDWAAQRCTISYRAPELFSVQSHCVIDERTDVWSLGCVLYAMMFGEGPYDMVFQ 
KGDSVALAVQNQLSIPQSPRHSSALRQLLNSMMTVDPHQRPHIPLLLSQLEALQPPAPGQ 

HTTQI 

SEQ ID NO: 230_H05721_H 

MAVRQALGRGLQLGRALLLRFTGKPGRAYGLGRPGPAAGCVRGERPGWAAGPGAEPRRVG 
LGLPNRLRFFRQSVAGLAARLQRQFWRAWGCAGPCGRAVFLAFGLGLGLIEEKQAESRR 
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FIGURE 1Z 

AVSACQEIQAIFTQKSKPGPDPLDTRRLQGFRLEEYLIGQSIGKGCSAAVYEATMPTLPQ 
NLEVTKSTGLLPGRGPGTSAPGEGQERAPGAPAFPLAIKMMWNISAGSSSEAILNTMSQE 
LVPASRVALAGEYGAVTYRKSKRGPKQLAPHPNIIRVLRAFTSSVPLLPGALVDYPDVLP 
SRLH PEGLGHGRTLFLVMKNY PCTLRQYLCVNTPS PRLAAMMLLQLLEGVDHLVQQG I AH 
RDLKSDNILVELDPDGCPWLVIADFGCCLADESIGLQLPFSSWYVDRGGNGCLMAPEVST 
ARPGPRAVIDYSKADAWAVGAIAYEIFGLVNPFYGQGKAHLESRSYQEAQLPALPESVPP 
DVRQLVRALLQREASKRPSARVAANVLHLSLWGEH I LALKNLKLDKMVGWLLQQSAATLL 
ANRLTEKCCVETKMKMLFLANLECETLCQAALLLCSWRAAL 

SEQ ID NO: 231_AI086865_H 

MEKYERIRWGRGAFGIVHLCLRKADQKLVIIKQIPVEQMTKEERQAAQNECQVLKLLNH 
PNVIEYYENFLEDKALMIAMEYAPGGTLAEFIQKRCNSLLEEETILHFFVQILLALHHVH 
THLI LHRDLKTQN I LLDKHRMWKI GDFG I SKI LS SKSTPCYI SPELCEGKPYNQKSDI W 
ALGCVLYELASLKRAFEAANLPALVLKI MSGTFAP I SDRY S PELRQLVLS LLS LEPAQRP 
PLSHI MAQPLC I RALLNLHTDGREVRGPQQHREQDHQCPLQRGI I MTFGSGSNGCLGHGS 
LTDISQPTIVEALLGYEMVQQVEEALSFTLLGSAPLDQEPLLSIDLGTAHSAAVTGEEDL 
GSGDVNRLPSWERGHLLAGVASSTDVSTFSEGDCKEPDKCCWRHKQCTGHI I YPFASDCV 
RHSLHLHSVNHCNCNSRLKDSSEDSSSSRGAGPTCSHVIESPCFELTPEEEHVERFRYGW 
CKSYRPVSVAVIHHPLYHECGADDLNXKKRKRRRRKSKPPIPTQVGPATASPDLGTSMAT 
GTPDSTAPITIWRSESPTGKGQGSKVIKKVKKKKEKEKDKEEMDEKAKLKKKAKKGQLTK 
KKSPVKLEPSPPDVSRSLSARQLARMSESSPESREELESEDSYNGRGQGELSSEDIVESS 
S PRKRENTVQAKKTGAKP SQARKVNKRKS P PGSNPNLS 

SEQ ID NO: 232_AA836348_H 

MSVLGEYERHCDSINSDFGSESGGCGDSSPGPSASQGPRAGGGAAEQEELHYIPIRVLGR 
GAFGEATL YRRTEDD S L WWKEVDLTRLS EKERRDALNE I V I LALLQHDN 1 1 A Y YNH FMD 
NTTLLIELEYCNGGNLYDKILRQKDKLFEEEMWWYLFQIVSAVSCIHKAGILHRDIKTL 
NIFLTKANLIKLGDYGLAKKLNSEYSI^IAETLVGTPYYMSPELCQGVKYNFKSDIWAVGCV 
IFELLTLKRTFDATNPLNLCVKIVQGIRAMEVDSSQYSLELIQMVHSCLDQDPEQRPTAD 
ELLDRPLLRKRRRSSTVTEAPIAWTSRTSEVYVWGGGKSTPQKLDVIKSGCSARQVCAG 
NTHFAWTVEKELYTVA7NMQGGTKLHGQLGHGDKASYRQPKHVEKLQGKAIRQVSCGDDF 
TVCVTDEGQLYAFGSDYYGCMGVDKVAGPEVLEPMQLNFFLSNPVEQVSCGDNHVWLTR 
NKEVYSWGCGEYGRLGLDSEEDYYTPQKVDVPKALIIVAVQCGCDGTFLLTQSGKVLACG 
LNEFNKLGLNQCMSGI INHEAYHEVPYTTSFTLAKQLSFYKIRTIAPGKTHTAAIDERGR 
LLTFGCNKCGQLGVGNYKKRLGINLLGGPLGGKQVIRVSCGDEFTIAATDEKVLNSKTIR 
SNS SGLS IGTVFQSS S PGGGGGGGGGEEEDSQQESETPDPSGGFRGTMEADRGMEGLI S P 
TEAMGNSNGASSSCPGWLRKELENAEFIPMPDSPSPLSAAFSESEKDTLPYEELQGLKVA 
SEAPLEHKPQVEASVTELFAFESQLVTSAESCSNLCWEGNTTDSSCVCVQLSAGGG 

SEQ ID NO: 23 3_R86668_H, MKK6_H 

MNLLLSYRDVQDYSAI I ELVETLQALPTCDVAEQHNVCFHYTFALNRRNRPGDRAKALSV 
LLPLVQLEGS VAPDLYCMCGR I Y KDMFFS SGFQDAGHREQAYHWYRKAFDVE P S LH SG I N 
AAVLLIAAGQHFEDSKELRLIGMKLGCLLARKGCVEKMQYYWDVGFYLGAQILANDPTQV 
VLAAEQLYKLNAPIWYLVSVMETFLLYQHFRPTPEPPGGPPRRAHFWLHFLLQSCQPFKT 
ACAQGDQCLVLVLEMNKVLLPAKLEVRGTDPVSTVTLSLLEPETQDIPSSWTFPVASICG 
VSASKRDERCCFLYALPPAQDVQLCFPSVGHCQWFCGLIQAWVTNPDSTAPAEEAEGAGE 
MLEFDYEYTETGERLVLGKGTYGWYAGRDRHTRVRIAI KEI PERDSRFSQPLHEE I ALH 
RRLRHKNI VRYLGSASQGGYLKI FMEEVPGGSLS SLLRS VWG PLKDNEST I S FYTRQI LQ 
GLGYLHDNHIVHRDIKGDNVLINTFSGLLKISDFGTSKRLAGITPCTETFTGTLQYMAPE 
IIDQGPRGYGKAADIWSLGCTVIEMATGRPPFHELGSPQAAMFQVGMYKVHPPMPSSLSA 
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FIGURE 1AA 

EAQAFLLRTFEPDPRLRASAQTLLGDPFLQPGKRSRSPSSPRHAPRPSDAPSASPTPSAN 
STTQSQTFPCPQAPSQHPPSPPKRCLSYGGTSQLRVPEEPAAEEPASPEESSGLSLLHOE 
SKRRAMLAAVLEQELPALAENLHQEQKQEQGARLGRNHVEELLRCLGAHIHTPNRRQLAQ 

ELRALQGRLRAQGLG PALLHRPL F AF PDAVKQ I LRKRQI RPHWMFVLDSLLSRAVRAALG 
VLGPEVEKEAVSPRSEELSNEGDSQQSPGQQSPLPVEPEQGPAPLMVQLSLLRAETDRLR 
EILAGKEREYQALVQRALQRLNEEARTYVLAPEPPTALSTDQGLVQWLQELNVDSGTIQM 
LLNHSFTLHTLLTYATRDDLIYTRIRGGMVCRIWRAILAQRAGSTPVTSGP 

SEQ ID NO: 234_PAK6_H 

MFGKKKKKIEISGPSNFEHRVHTGFDPQEQKFTGLPQQWHSLLADTANRPKPMVDPSCIT 
PIQLAPMKTIVRGNKPCKETSINGLLEDFDNISVTRSNSLRKESPPTPDQGASSHGPGHA 
EENGFITFSQYSSESDTTADYTTEKYREKSLYGDDLDPYYRGSHAAKQNGHVMKMKHGEA 
YYSEVKPLKSDFARFSADYHSHLDSLSKPSEYSDLKWEYQRASSSSPLDYSFQFTPSRTA 
GTSGCSKESLAYSESEWGPSLDDYDRRPKSSYLNQTSPQPTMRQRSRSGSGLQEPMMPFG 
ASAFKTHPQGHSYNSYTYPRLSEPTMCIPKVDYDRAQMVLSPPLSGSDTYPRGPAKLPQS 
QSKSGYSSSSHQYPSGYHKATLYHHPSLQSSSQYISTASYLSSLSLSSSTYPPPSWGSSS 
DQQPSRVSHEQFRAALQLWSPGDPREYLANFIKIGEGSTGIVCIATEKHTGKQVAVKKM 
DLRKQQRRELLFNEWIMRDYHHDNVVDMYSSYLVGDELWVVMEFLEGGALTDIVTHTRM 
NEEQIATVCLSVLRALSYLHNQGVIHRDIKSDSILLTSDGRIKLSDFGFCAQVSKEVPKR 
KSLVGTPYWMAPEVISRLPYGTEVDIWSLGIMVIEMIDGEPPYFNEPPLQAMRRIRDSLP 
PRVKDLHKVSSVLRGFLDLMLVREPSQRATAQELLGHPFLKLAGPPSCIVPLMRQYRHH 

SEQ ID NO: 235_SURTK106_H 

MNDRNEIQMEAKLQSLTI IAQEILCRFFITLRRHARFLLTKLGRQGMARSGITHSCAVCI 
LCGPSREGDSPVAMGMTRMLLECSLSDKLCVIQEKQYEVI IVPTLLVTIFLILLGVILWL 
FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ 
VPREQLSEVLEQICSGSCGPIFRANMNTGDPSKPKSVILKALKEPAGLHEVQDFLGRIQF 
HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLLGFLWTCRRDVMTMDGLLYDLTEKQ 
VYHIGKQVLLALEFLQEKHLFHGDVAARNILMQSDLTAKLCGLGLAYEVYTRGAISSTQT 

I PLKWLAPERLLLRPAS I RADWJS FGI LLYEMVTLGAPPYPEVPPTS I LEHLQRRKIMKR 
PSSCTHTMYSIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVLQVPELWPELYAAV 

AGIRVESLFYNYSML 

SEQ ID NO: 236_AA098024_M 

LQEKHLFHGDVAARNILIQSDLTPKLCHLGLAYEVHAHGAISSARSSTIPLKWLAPERLL 
LRPASIRGDIWSFGI LLYEMVTLGAPPYPEVPPTS I LQYLQRKKIMKRPSSCSHAMYNIM 
KCC WRW S ED S R PLL VQLLQRLE AAS RS ADDKAVLQVPELW PEL YAD VAG I RAE S I SYSF 
SVL 

SEQ ID NO: 237_SGK2ALPHA_H 

MNSSPAGTPSPQPSRANGNINLGPSANPNAQPTDFDFLKVIGKGNYGKVLLAKRKSDGAF 
YAVKVLQKKS I LKKKEQSH I MAERS VLLKNVRHPFLVGLRYS FQTPEKLYFVLDYVNGGE 
LFFHLQRERRFLEPRARFYAAEVASAIGYLHSLNIIYRDLKPENILLDCQGHWLTDFGL 
CKEGVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVS 
QMYENILHQPLQIPGGRTVAACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDL 
YHKRLTPPFNPNVTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLGFSYAPE 

DDDILDC 



PCT/USOO/14842 

WO 00/73469 

FIGURE IBB 

AASQGLHMQNDDACLGAASPECCLLVKEKCRE 

SEQ ID NO: 23 9.TESK2 H GRVWpssYRAL I SAFSRLTRLDDFT 

MDRSKRNSIAGFPPRVERLEEFEGGGGGEG^SQV^ LMNRLS HPNILRYINSG 

CBKIGSGFFSEVFKVWO^^ 

nleqlldsnlhlpwiwvkiaydiavglsylh™ 

STSGIGLQTQGKQDG 
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FIGURE 2A 

SEQ ID NO: 1_X69117_H BARK2_H 

ATGGCGGACCTGGAGGCCGTGCTGGCCGATGTCAGTTACCTGATGGCCATGGAGAAGAGC 

AAGGCGACCCCGGCCGCCCGCGCCAGCAAGAGGATCGTCCTGCCGGAGCCCAGTATCCGG 

AGTGTGATGCAGAAGTACCTTGCAGAGAGAAATGAAATAACCTTTGACAAGATTTTCAAT 

CAGAAAATTGGTTTCTTGCTATTTAAAGATTTTTGTTTGAATGAAATTAATGAAGCTGTA 

CCTCAGGTGAAGTTTTATGAAGAGATAAAGGAATATGAAAAACTTGATAATGAGGAAGAC 

CGCCTTTGCAGAAGTCGACAAATTTATGATGCCTACATCATGAAGGAACTTCTTTCCTGT 

TCACATCCTTTCTCAAAGCAAGCTGTAGAACACGTACAAAGTCATTTATCCAAGAAACAA 

GTGACATCAACTCTTTTTCAGCCATACATAGAAGAAATTTGTGAAAGCCTTCGAGGTGAC 

ATTTTTCAAAAATTTATGGAAAGTGACAAGTTCACTAGATTTTGTCAGTGGAAAAACGTT 

GAATTAAATATCCATTTGACCATGAATGAGTTCAGTGTGCATAGGATTATTGGACGAGGA 

GGATTCGGGGAAGTTTATGGTTGCAGGAAAGCAGACACTGGAAAAATGTATGCAATGAAA 

TGCTTAGATAAGAAGAGGATCAAAATGAAACAAGGAGAAACATTAGCTTTAAATGAAAGA 

ATCATGTTGTCTCTTGTCAGCACAGGAGACTGTCCTTTCATTGTATGTATGACCTATGCC 

TTCCATACCCCAGATAAACTCTGCTTCATCCTGGATCTGATGAACGGGGGCGATTTGCAC 

TACCACCTTTCACAACACGGTGTGTTCTCTGAGAAGGAGATGCGGTTTTATGCCACTGAA 

ATCATTCTGGGTCTGGAACACGTGCACAATCGGTTTGTTGTCTACAGAGATTTGAAGCCA 

GCAAATATTCTCTTGGATGAACATGGACACGCAAGAATATCAGATCTTGGTCTTGCCTGC 

GATTTTTCCAAAAAGAAGCCTCATGCGAGTGTTGGCACCCATGGGTACATGGCTCCCGAG 

GTGCTGCAGAAGGGGACGGCCTATGACAGCAGTGCCGACTGGTTCTCCCTGGGCTGCATG 

CTTTTCAAACTTCTGAGAGGTCACAGCCCTTTCAGACAACATAAAACCAAAGACAAGCAT 

GAAATTGACCGAATGACACTCACCGTGAATGTGGAACTTCCAGACACCTTCTCTCCTGAA 

CTGAAGTCCCTTTTGGAGGGCTTGCTTCAGCGAGACGTTAGCAAGCGGCTGGGCTGTCAC 

GGAGGCGGCTCACAGGAAGTAAAAGAGCACAGCTTTTTCAAAGGTGTTGACTGGCAGCAT 

GTCTACTTACAAAAGTACCCACCACCCTTGATTCCTCCCCGGGGAGAAGTCAATGCTGCT 

GATGCCTTTGATATTGGCTCATTTGATGAAGAGGATACCAAAGGGATTAAGCTACTTGAT 

TGCGACCAAGAACTCTACAAGAACTTCCCTTTGGTCATCTCTGAACGCTGGCAGCAAGAA 

GTAACGGAAACAGTTTATGAAGCAGTAAATGCAGACACAGATAAAATCGAGGCCAGGAAG 

AGAGCTAAAAATAAGCAACTTGGCCACGAAGAAGATTACGCTCTGGGGAAGGACTGTATT 

ATGCACGGGTACATGCTGAAACTGGGAAACCCATTTCTGACTCAGTGGCAGCGTCGCTAT 

TTTTACCTCTTTCCAAATAGACTTGAATGGAGAGGAGAGGGAGAGTCCCGGCAAAATTTA 

CTGACAATGGAACAGATTCTCTCTGTGGAAGAAACTCAAATTAAAGACAAAAAATGCATT 

TTGTTCAGAATAAAAGGAGGGAAACAATTTGTCTTGCAATGTGAGAGTGATCCAGAGTTT 

GTGCAGTGGAAGAAAGAGTTGAACGAAACCTTCAAGGAGGCCCAGCGGCTATTGCGTCGT 

GCCCCGAAGTTCCTCAACAAACCTCGGTCAGGTACTGTGGAGCTCCCAAAGCCATCCCTC 

TGTCACAGGAACAGCAACGGCCTCTGA 
SEQ ID NO: 2 AA144574_M BARK2_M 

CTGCTTCGTAGTCTACAGAGACCTGAAGCCTGCGAACATCCTCCTAGATGAATATGGGCA 
CGTGAGGATATCGGATCTCGGCCTTGCCTGTGATTTCTCCAAAAAGAAGCCTCATGCCAG 
CGTGGGCACCCATGGGTACATGGCTCCCGAGGTGTTGCAGAAGGGAACGTGCTATGACAG 
CAGCGCCGACTGGTTCTCCCTGGGCTGTATGCTCTTCAAACTTCTGCGGGGCCACAGCCC 
' CTTCAGGCAGCATAAAACCAAAGACAAGCATGAGATAGACCGAATGACCCTGACCGTGAA 
CGTGCAGCTTCCAGATGCCTTCTCCCCTGAGCTGAGGTCCCTCTTAGAGGGTTTGCTCCA 
GCGGGACGTGAGCCAGCGGCTGGGCTGCGGAGGAGGAGGGGCACGAGAGTTGAAGGAGCA 
CATCTTCTTCAAGGGCATTGACTGGCAGCATGTGTACTTACGGAAGTACCCGCCACCCCT 
AATCCCTCCTCGGGGAGAGGTCAACGCTGCAGATGCCTTCGATATCGGCTCCTTCGATGA 
GGAAGACACCAAAGGCATTAAGCTGTTGGACTGTGACCAGGACCTCTATAAGAACTTCCC 
ACTGGTGATCTCCGAGCGCTGGCAGCAAGAAGTGGTGGAGACCATCTATGACGCCGTCAA 
TGCTGATACTGATAAAATCGAGGCCAGGAAGAAGGCTAAAAATAAGCAACTTGGTCAAGA 
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FIGURE 2B 



GGAAGATTACGCTATGGGGAAGGACTGCATCATGCACGGGTACATGCTGAAGCTGGGGAA 
CCCCTTTCTCACACAGTGGCAAAGACGCTATTTTTACCTGTTCCCCAACAGACTGGAGTG 
GAGAGGAGAGGGCGAGTCTCGGCAAAGTCTACTGACCATGGAACAGATCATGTCTGTGGA 
GGAGACCCAGATTAAAGACAGAAAGTGCATCTTACTCAGGATAAAGGGAGGGAAGCAATT 
TGTCTTGCAATGTGAGAGTGACCCCGAGTTTGCACAGTGGCTGAAGGAGCTGACCTGCAC 
CTTCAATGAGGCCCAGAGACTGCTGCGCCGTGCCCCCAAATTCCTCAACAAACCACGGGC 
CGCCATCCTGGAGTTCTCCAAGCCACCACTGTGTCACAGAAATAGCAGCGGCCTCTGAAC 
CACAGAGCAGCGGGGCCTGAAGGAGGGGCCCCAGCTCTTCAGCCCAGGAGTGGAACGAAG 
CCACGGGGAACCGTGTGGGGCTAAGACACAGTGTTTCTGAGCACTGACGGGGCTGCTCCA 
AGCCGAGGAGGCTCAGGACACCAGGGCGGCCTTCTGGGAGCTGGGACATCCTCGGGGCTG 
TCCTATCCACACTCGAAATTACTGAAGAAGCAGAGGCATTCTGCTGTG 

SEQ ID NO: 3_AA826850_H 

GAAGAGGATGGGCTCGTCCATGTCGGCGGCCACCGCGCGGAGGCCGGTGTTTGACGACAA 

GGAGGACGTGAACTTCGACCACTTCCAGATCCTTCGGGCCATTGGGAAGGGCAGCTTTGG 

CAAGGTGTGCATTGTGCAGAAGCGGGACACGGAGAAGATGTACGCCATGAAGTACATGAA 

CAAGCAGCAGTGCATCGAGCGCGACGAGGTCCGCAACGTCTTCCGGGAGCTGGAGATCCT 

GCAGGAGATCGAGCACGTCTTCCTGGTGAACCTCTGGTACTCCTTCCAGGACGAGGAGGA 

CATGTTCATGGTCGTGGACCTGCTACTGGGCGGGGACCTGCGCTACCACCTGCAGCAGAA 

CGTGCAGTTCTCCGAGGACACGGTGAGGCTGTACATCTGCGAGATGGCACTGGCTCTGGA 

CTACCTGCGCGGCCAGCACATCATCCACAGAGATGTCAAGCCTGACAACATTCTCCTGGA 

TGAGAGAGGACATGCACACCTGACCGACTTCAACATTGCCACCATCATCAAGGACGGGGA 

GCGGGCGACGGCATTAGCAGGCACCAAGCCGTACATGGCTCCGGAGATCTTCCAXTCTTT 

TGTCAACGGCGGGACCGGCTACTCCTTCGAGGTGGACTGGTGGTCGGTGGGGGTGATGGC 

CTATGAGCTGCTGCGAGGATGGAGGCCCTATGACATCCACTCCAGCAACGCCGTGGAGTC 

CCTGGTGCAGCTGTTCAGCACCGTGAGCGTCCAGTATGTCCCCACGTGGTCCAAGGAGAT 

GGTGGCCTTGCTGCGGAAGCTCCTCACTGTGAACCCCGAGCACCGGCTCTCCAGCCTCCA 

GGACGTGCAGGCAGCCCCGGCGCTGGCCGGCGTGCTGTGGGACCACCTGAGCGAGAAGAG 

GGTGGAGCCGGGCTTCGTGCCCAACAAAGGCCGTCTGCACTGCGACCCCACCTTTGAGCT 

GGAGGAGATGATCCTGGAGTCCAGGCCCCTGCACAAGAAGAAGAAGCGCCTGGCCAAGAA 

CAAGTCCCGGGACAACAGCAGGGACAGCTCCCAGTCCGAGAATGACTATCTTCAAGACTG 

CCTCGATGCCATCCAGCAAGACTTCGTGATTTTTAACAGAGAAAAGCTGAAGAGGAGCCA 

GGACCTCCCGAGGGAGCCTCTCCCCGCCCCTGAGTCCAGGGATGCTGCGGAGCCTGTGGA 

GGACGAGGCGGAACGCTCCGCCCTGCCCATGTGCGGCCCCATTTGCCCCTCGGCCGGGAG 

CGGCTAGGCCGGGATGCCCGTGGTCCTCACCCCTTGAGCTGCTTTGGAGACTCGGCTGCC 

AGAGGGAGGGCCATGGGCCGAGGCCTGGCATTCACGTTCCCACCCAGCCTGGCTGGCGGT 

GCCCACAGTGCCCCGGACACATTTCACACCTCAGGCTCGTGGTGGTGCAGGGGACAAGAG 

GCTGTGGGTGCAGGGGACACCTGTGGAGGGCATTTCCCGTGGGCCCCCGAGACCCGCCTA 

GATGGAGGAAGCGCTGCTGGGCGCCCTCTTACCGCTCACGGGGAGCTGGGGCCATGGATG 

GGACAGGAGTCTTTGTCCCTGCTCAGCCCGGAGGCTGTGCACGGCCCTCGTCACAAGGTG 

ACCCTTGCAGCACAGGCCGCGGGTGCCCCAGGCTCGGCTCAGTTCTTGGAGGTCAAGGGC 

ATGGGTTGGGGTAGTGGGTGGGGAGGTGAATGTTTTCTAGAGATTCAAACTGCTCCAGCA 

ATTTCTGTATAGTTTTCACCTCTGAGAATTACAATGTGAGAACCGCTC 

SEQ ID NO: 4 AA960957_H 

GTCCCACATCCCGCATCCGGCATCCCAGCGGCCGGGCATGTAGCAGCGGCAGCAACGGCG 
GAATATGGGCGGGAACCACTCCCACAAGCCCCCCGTGTTTGACGAGAATGAGGAAGTCAA 
CTTTGACCATTTTCAGATTCTGCGGGCCATTGGTAAAGGGAGTTTTGGAAAGGTATGCAT 
CGTGCAGAAGCGAGACACTAAGAAAATGTATGCAATGAAGTACATGAACAAGCAGAAGTG 
CATCGAGAGGGATGAGGTTCGGAATGTTTTCCGGGAGCTGCAGATCATGCAAGGGCTGGA 
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FIGURE 2C 

AGAGGGGACTGTGAAACTCTACATCTGT^ 

GAGGAAGCTCCTGACCAAGGATCCTGAGAGCCGTO 
CGTGCCCTACTTGGCCGACATGAACTGGGA^ 



TCTAGAATCCAAGCCACTTCACAAAAAGAA ^ CTGTTTGGRGRC TGTCCG 
TGGCACAAAGGACAGCTGCCCGCTG^TGGACACCTGCAGCACTGTTT 

GGAGGAATTCATCATATTCAACAGAGAGA^GCTC^ 



CAGGGAGAAGCTTGCCTTTGAACTGGAAGATGTTGGGATG^AG 
TTTTTATTTTTTCATCCAACT^ 

GCTGTCCATGTCATCTCTTAGAmCiiA^ TCAACCGAGACTATTTC 
ATTTTTATTTTTTAAAAAAGAAATAGTCAGT^ 

tggattgtgtgctcctcgtcagttgacttg™ 
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FIGURE 2D 

ATGTTCAAATGAGAGAATTTGAAGTGTTGAAAAAA^i^ rTACTTATTAT GGAATTTTGTC 

tSttgctattgaagaggagacaacaacaaga^ 

CATGTGGGAGTTTATACACTGTTTTAGAAGAACCTTC 

SgaIttcttaattgttttgcg^ 

GTATAGTGCACCGTGATATCA^GCCAGG^ 

agtctgtgtacaaactcacagattttggtg^ 

TTGTTTCTCTGTATGGCACAGAAGAATAT^GCACCLi^ GAGCATTGG GGTAA 

^GAAAAGATCATCAGAAGAAATA^ 

CATTTTACCATGCAGCTACTGC^T^ 

TTAAAGATGATTACAATGAAACTGTTCACAAAAA^C TGAT GAAGATCAACC 
TCTGTATCAGAAACATTGAAAAAACTGTGAAAGTA^ 

AGGCATTTTTGAATAAGT^ 
TATTATCGCTGACTAATCAGTGTTTTGAT^ 

ATACTAATGAGTTACAAGAAACTCTGCCTCAGA^ 
AACATACCATGACCCCAATTTATCCA^GTTCTAAC^ 
TGAAGAAATTAAAGGAAGAGATGGAAGGGGTGGTT^ 

TTTTAGAAAGGTTTGG^ 



TGAAGAAATTAAAGGAAGAGATGGAAGG^i CCTTCGC AACGTTGACTGTCTTT 
TTTTAGAAAGGTTTGGCTCTT^ 

AGCTTTCTAATAGAAGTTTAAGAAAAGlTTCCGll^ CAGT ATTTGATGTGGTCG 
^TGAATGCCTTTATA^ 

TGTAAATATGTACAATATTGTA^ 
GATGTAATTTTATCTTTTAACATTTATAATTAT^ 

GAGAAGAAAGCCATGACCGACCAATATGTTGACATAC GTGGAAC CAAAATAGG 
TAAATAAGTTATTTTCTCTGACCGCCT^^ 

CA^CTTACAAATCAGGAAGACTGA^^ 
TGTGAGTGGGGAGCAGAACCGCACCACTGTTAT^ 

ATAAAGTGGCATTATTTTATTTTAC^^ 
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FIGURE 2E 



GTTTTAAGCTGTATATTTCTTTAATTCTGCTTACTATTTCATGGAAAAAAATAAATTTCT 
CAATTTTAAAAAA 

SEQ ID NO: 6_AA305176_H 

TGGCTGCTCGCGGAGGGGCAGTGTACGCGGGGCCGCTGTAGGCTGTCCAGCGATGGATCC 
CACCGCGGGAAGCAAGAAGGAGCCTGGAGGAGGCGCGGCGACTGAGGAGGGCGTGAATAG 
GATCGCAGTGCCAAAACCGCCCTCCATTGAGGAATTCAGCATAGTGAAGCCCATTAGCCG 
GGGCGCCTTCGGGAAAGTGTATCTGGGGCAGAAAGGCGGCAAATTGTATGCAGTAAAGGT 
TGTTAAAAAAGCAGACATGATCAACAAAAATATGACTCATCAGGTCCAAGCTGAGAGAGA 
TGCACTGGCACTAAGCAAAAGCCCATTCATTGTCCATTTGTATTATTCACTGCAGTCTGC 
AAACAATGTCTACTTGGTAATGGAATATCTTATTGGGGGAGATGTCAAGTCTCTCCTACA 
TATATATGGTTATTTTGATGAAGAGATGGCTGTGAAATATATTTCTGAAGTAGCACTGGC 
TCTAGACTACCTTCACAGACATGGAATCATCCACAGGGACTTGAAACCGGACAATATGCT 
TATTTCTAATGAGGGTCATATTAAACTGACGGATTTTGGCCTTTCAAAAGTTACTTTGAA 
TAGAGATATTAATATGATGGATATCCTTACAACACCATCAATGGCAAAACCTAGACAAGA 
TTATTCAAGAACCCCAGGACAAGTGTTATCGCTTATCAGCTCGTTGGGATTTAACACACC 
AATTGCAGAAAAAAATCAAGACCCTGCAAACATCCTTTCAGCCTGTCTGTCTGAAACATC 
ACAGCTTTCTCAAGGACTCGTATGCCCTATGTCTGTAGATCAAAAGGACACTACGCCTTA 
TTCTAGCAAATTACTAAAATCATGTCTTGAAACAGTTGCCTCCAACCCAGGAATGCCTGT 
GAAGTGTCTAACTTCTAATTTACTCCAGTCTAGGAAAAGGCTGGCCACATCCAGTGCCAG 
TAGTCAATCCCACACCTTCATATCCAGTGTGGAATCAGAATGCCACAGCAGTCCCAAATG 
GGAAAAAGATTGCCAGGTTTGAGGGACATTTATCTTAATGAAAATCT^ATTATGTATGTCA 
AATGAATGTGAGAAATATTATACCTTTTCATATAAATTCCATAAAGAAATGAAATTGTTA 
CATGAATGGCAGTCATAGTATTAATCAGAAATTCATTTTCCTGCACATTCTGTCAAATTC 
TTTTGAAATATTTCATTTCTCATTCAATTGTGACATTGTTCTTACTTGATTATATAATGA 
GATTCTTGCAGTAAATTGATAATAAATGCTTGGCTTCTGTGTATCTAGGTGGACCTCACT 
TGTTTTTAGAAGTCCTTCCCATGATACAGACATTGGCTTGTTGGTTTTGTTTTATTTTGT 
TTTTAACATATGTCATTTAAAAACTCATATTACCTCCTTTT 

SEQ ID NO: 7_AA116841_M 

CCACGCGTCCGATCCCATGGCCAGAAGGCGAAGAAAAGCTATCTGATAATGCTCAAAGTG 
CAATGGACATGCTTTTAACCATTGATGATTCAAAGAGAGCTGGAATGAGAGAACTAAAAC 
AGCATCCTCTCTTCAGTGAAGTGGACTGGGAAAATCTGCAGCATCAGACTATGCCTTTCG 
TACCCCAACCAGACGACGAAACAGATACATCCTATTTTGAAGCCAGAAATAATGCTCAAC 
ATCTGACCGTATCTGGGTTTAGTCTGTAGCACATGCGTGTCATTTTTATCTAACTTGTGA 
TATAGAATTAAGTTTTACAGTAATATGCTACTTAATACTAGATTGGTCTAAATGGGATAA 
AAGTCATTATTTTACCCAGACTGAACAGCTTTTAATTACTAAGTACAACAGTTTTTACAG 
AATTAAAATACTATAAGCAATATAATCAGTAATTAATCTTTACCTTAGAACTGTATATAA 
GCCATAATAGCTTTTTTCATCTTATTTATTCACTGCACTTTATGAAGAGCAAAGTATCAA 
TAAACTAAAACACTACCACTCTAAATAGAGGGAGTGAGCCGT 

SEQ ID NO: 8_AA256100_H 

AGGGAGCTGACGGGCGCCCGGCCGGCTGCGGTCCGTGCGGAGGCTGAGCCGGCCGCGGGC 
GCGACCGGAGGCAGTTTCCGTTACTATGGCAATGACGGCAGGGACTACAACAACCTTTCC 
TATGAGCAACCATACCCGGGAAAGAGTGACTGTAGCCAAGCTCACATTGGAGAATTTTTA 
TAGCAACCTAATTTTACAGCATGAAGAGAGAGAAACCAGGCAGAAGAAATTAGAAGTGGC 
CATGGAAGAAGAAGGATTAGCAGATGAAGAGAAAAAGTTACGTCGATCACAACACGCTCG 
CAAAGAAACAGAGTTCTTACGGCTCT^AAAGGACCAGACTTGGCTTGGATGACTTTGAGTC 
TCTGAAAGTTATAGGAAGAGGAGCTTTTGGAGAGGTGCGGTTGGTCCAGAAGAAAGATAC 
AGGCCATATCTATGCAATGAAGATATTGAGAAAGTCTGATATGCTTGAAAAAGAGCAGGT 
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FIGURE 2F 

GGCCCATATCCGAGCAGAAAGAGATATTTTGGTAGAAGCAGATGGTGCCTGGGTGGTGAA 

GATGTTTTACAGTTTTCAGGATAAGAGGAATCTTTATCTAATCATGGAATTTCTCCCTGG 

AGGTGACATGATGACATTGCTAATGAAGAAAGACACCTTGACAGAAGAGGAAACACAGTT 

CTACATTTCAGAGACTGTTCTGGCAATAGATGCGATCCACCAGTTGGGTTTCATCCATCG 

GGATATTAAGCCAGACAACCTTTTATTGGATGCCAAGGGTCATGTAAAATTATCTGATTT 

TGGTTTATGTACGGGATTAAAGAAAGCTCACAGGACTGAATTTTATAGAAATCTCACACA 

CAACCCACCAAGTGACTTCTCATTTCAGAACATGAACTCAAAGAGGAAAGCAGAAACTTG 

GAAGAAGAACAGGAGACAACTGGCATATTCCACAGTTGGGACACCAGATTACATTGCTCC 

AGAAGTATTCATGCAGACTGGTTACAACAAATTGTGTGACTGGTGGTCTTTGGGAGTGAT 

TATGTATGAAATGCTAATAGGATATCCACCTTTCTGCTCTGAAACACCTCAAGAAACATA 

CAGAAAAGTGATGAACTGGAAAGAAACTCTGGTATTTCCTCCAGAGGTACCTATATCTGA 

GAAAGCCAAGGACTTAATTCTCAGATTTTGTATTGATTCTGAAAACAGAATTGGAAATAG 

TGGAGTAGAAGAAATAAAAGGTCATCCCTTTTTTGAAGGTGTCGACTGGGAGCACATAAG 

GGAAAGGCCAGCAGCAATCCCTATAGAAATCAAAAGCATTGATGATACTTCAAATTTTGA 

TGACTTCCCTGAATCTGATATTTTACAACCAGTGCCAAATACCACAGAACCGGACTACAA 

ATCCAAAGACTGGGTTTTTCTCAATTATACCTATAAAAGGTTTGAAGGGTTGACTCAACG 

TGGCTCTATCCCCACCTACATGAAAGCTGGGAAGTTATGAATGAAGATAACATTCACCCA 

TAACCAAGAGAACTCAGGTAGCTGCATCACCAGGCTTGCTTGGCGTAGATAACAATACAC 

TGAAATACTCCTGAAGATGGTGGTGCTTATTGACTACAAGAGGAAATTCTACAGGATTAG 

GATTTCTAAGACTACTATAGGAATTGGTTGGCAGTGCCAGCTGGCTCTTTTTTTTAATAT 

TTTATTATTTTTGTTAACTTTATTATATGAAGGTACTGGAATAAAAGGAACAGACATCCC 

TTTCTAACTGCACTGCCTACATGCGTATTAAGGTCCATTCTGCCTGTGTGTGCTGTGGCT 

TTGAACTGTAACACCTCTAATCAATTCAGGAGAAACACATATCATTTAAAGCAACATAGG 

CTAACCTGTAGGTAACACTGCAGTATTGATGTTTTACTGCAAATCTTATGGGTCTAGATA 

ATCAGTAAAAGCCATCTTCCATAGTTGGTGTTAGAACATTGCCCTATTGGTTTGGACATC 

TGTAGAATATATATGAAGACAATTTCTGTAATGGTTTTAAGAGATTTAAAAAGAAATTCA 

CTGGTTCTTTACAAAATAGAATTTATCATCAAGTTATTACACAAACTTCACAGTAAGGAG 

TGACAAGTTTATAATAAGGAAGACAAAGTTTAACACCTTCACTCAAGCACTCCACTAATA 

TATTTACGTTGCATTCAGAAATACTGATGACCTTCATATACGTAGTCTGTATACTCATAG 

GGAGATGTACTGTATTATATAACATGTAAAGTTGATTTTCTTGTGACAAGAGAACTTCTT 

TTTTTAACAAGAGGACATGGCATTATTTTAATTTGATTATGGTGAGTTGAATTTAAGACA 

TGACCATGAAGGCTGCTTGTAGAATTAGTGTATTTTTATTAAACTATTTTTTTAAATGTC 

AAACTTCTATCATGTAAATGGACTTATAGAGAACAAAAAGCTATTTACTTTGGTTTTCTA 

GAAAGTTGTTACATATCATGGCTGGTTAACTTTTATTTCTTTTGATGAAAATTTTTCCTT 

TGATAGTACTTGTATTATTGTGCCATTATTTTCTTATGCTCCAAATGTACCAAAGATCTT 

GAACAGAGTGGATGTTCACAACTGAGTAGAATTTTCCTTTCCTGTGGGCATGCTGTATTC 

AGACCTGACAGATCTTTGATAGAGGTCAGCTTATTAAAGGGCAATATTGTTCTTGTTTAG 

CTACATCACTGTGGTGAATATAGATGGAATTAAGGAAGTAAATGCAGGCCAGGGGGTTGT 

GATGAGAGGATAGGGGAGATAATATCAGCATCAAATTCTTTGGGTATCTCTCTAAGAATT 

AAATAATCTTTTCTAGCTTAATATTTTAATTCTAATTCAAACAACTCTGAGGTTTTGGTT 

TCATTAGTAATAGTTGAGGAATAATATACTAGCAAAGAATGGCCTAATGTTTGTCATAAC 

TGTTAATGGATGAAATTTTTTAAAGATACAACCATGATAACCATTATAAATGATCTATGA 

TCAAAATCTAAAGTGATGAATTATTTGTAGGAATGTCTTCCTAATGGGGAAGAATTGCAT 

AGGAGCATTATGCAAATCTACACAAGCTTTTATAAATGTTGCTGCTGGGTAGCTCCACAG 

TGTTTCATAAGGCCATCCTGTTTCCCCCAACTCCCCCATTTTTGGTTTGTTTCTTTTTAA 

ATATTTGTTGAGTACTTACGTGTTTATCTAACAGTTCACTTCCATTTTTCTAGTCTGGAT 

TTTTTGAGTATTTAGGAAAGAGAGCTATTAAAAACTCTGGGGATTTCTCAATGTGACTAA 

CTCTAATTTTTCTAATTATAACTGCCTTTAATTAACATAATATTAACTTTTGCTGAGGTT 

TATGAGATTTTCTCACCCCACATCGCTCCCCTTTTTTTAAAAAGGACTGTTTTGCTAGTG 

TGATAATGAATAGGTAAGATATGAGATAATTGCAACATTGTCCTAGTTCTAGTATGGTAA 
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FIGURE 2G 

CTATTCTTGAAATGGTATTGAAAAATACCGTTAATTCAAATTGACAGAGATTGATAAAAA 

GAAACTGATTTACCTAAGTTTACTTTTTAATTGCATAATAGAGCATTTTTTGTTTTGAGT 

TCCCTCATTCTTATTACCAGAAAGAGCTTGCAAATAGTTTTACTTTCTTGGCACTGGAAG 

GGTAGTTCTGGAAAGCTACTTTGTTGAGAGTCTCATTCTTCCCTGGAGTTAATAGAGTGA 

TTCACAATCTTTGGGGTTTTCTCCTCATCAAAAGCATTTCTTAAGTGCCTATCTAAAAGC 

AATTAAAGACTGTGTCTGCCCTTTAGAAGCTAAGAATTTGATTCATGATGCAAATTAACT 

AGATAATTTGCAAAGTACCCTTGAGATTGAATTTTCTCTATTATATATTTCCCATATTTC 

AGGTGAATAATTTAATTTAAATGACAAAACCCTATCTAGTCAACTGGGCATAATGACATT 

TTCTTTAAATTAGACTCTATTTTGAATTAAAAGAGTTTTATTATAAACCGTGTGTTTTTG 

GTTTTTCTAAGTATATAGAAAGCTTGTATAATTCAGATTTATCAATTTCCTGATTTAATG 

TAGACTTTGACTTTTTTATTAAAAACCTTTGTATTAAAGCAAGTTATGTTATTTTTCTTT 

TATGCATTTATTACTAACATAGCTTTAAATCTTTAAATGTATTGAAGCATTGTGCTGTCT 

GAAAATAAGGAATTGCTTATAAACCAGCCACTTCTGAATACAATATGTAGCTGATTTAAT 

AAGCTAGTTAGTGAATGGAAAATAAGTGTGGAGTATTAAAAATGTTCTTTGGTTGGTAAG 

GCCTAAGATAGGGTTTCATTTATTTCTATACTTTTTCTGTTTTTTAAACACCTGCATATT 

TTTATGTAAATCTCTAAATTTAAAATATTTTAAGTACATTTATTTTTGGTGTTTTATTGT 

ATAAAACCTTAGACAATCAATCAGTCAGTCTTTACTGACAGGAGCAGCAGCTATCTGTCT 

TTTGCTGATCTACAAATAAATGAATTGAGAATTTAGTCCATAGAGGTCCCTGGCTACCAA 

ACACATTCTCCTTTGAATTGTTAAAATTCAGAACATTCAAAATAACTGTTTTGCTACAAC 

rrATGATTATTTTCCTGTTGTGTTTATTTAAATTTACTTTCTCTTTAGAAGTGCACTTAT 

TTCTGAAAAATCTTAATGAAACAAACGCTTAGAACAAATATAAATATGAGACACTTGGGA 

CTACTAGAGATATTTTAGATTTTTATGAAAAAAATGTGAGGGGATATTGCTGCTTTAAAA 

AGGAATAAAGTAATAAAAATATATCTCAGCTATTTTTTTAAAGCAATATAATTCAGCAAT 

TGTCTAGAAAAGTAATCATGAGGCTACTGAGTTTGGTGTTCAGTTACTGAGTTTCAAAAA 

TGTTTTGGTGGCATGAGGACAAAATTTCATTGAAGGTAAGATAAGAATAAAAACTATGTT 

TAC 

cun TD NO- 9 AA210825 H 

CACGAGGGCTACTCGCGCCTGGCGACCCTCCCTGCCCCCCACCCAACCCCGCTCCGGCAA 
ScCCCCTTCCTCACGGCTCCCGACCGAACTTTTCTCCAACTTCTGCGACTCGTGAGATT 
CCCTTCTACCCACTCCGGCCCTCGGGACCCCTCTGCCCATCCCCTGGCCGGTCGGGTCCC 
TGCGAACCCCTTTATCTCTGGAATCCACTCGGTCCCCGACTCAGAGACTCCTGCCCTCCA 

ccc?Saggaccccgccatcctcaggtcccctccgcctgccagatcttttctcggatcc^ 

CGCTCTCCCACCACCTGCTCACGAGATCCCGCGGATCTAGAACCCAGGGTCCCCCGGGGC 
CCCCCGGCGGGTCCCGGGTGGGCTCCAGGCGGGCGGTCCCCGGCCTCCCCCCATGGCCAC 
CCCCCCCTCATTATCCCGCCGGGCTCCCTGGCTCTCCCGGGCCGGGGTCTCCTCCGCCCC 
CCGGCGGCCTAGAGCTGCAGTCGCCGCCACCGCTACTGCCCCAGATCCCGGCCCCGGGTT 
CCGGGGTCTCCTTTCACATCCAGATCGGGCTGACCCGCGAGTTCGTGCTGTTGCCCGCCG 
CCTCCGAGCTGGCTCATGTGAAGCAGCTGGCCTGTTCCATCGTGGACCAGAAGTTCCCTG 
AGTGTGGCTTCTACGGCCTTTACGACAAGATCCTGCTTTTCAAACATGACCCCACGTCGG 
CCAACCTCCTGCAGCTGGTGCGCTCGTCCGGAGACATCCAGGAGGGCGACCTGGTGGAGG 
TGGTGCTGTCGGCCTCGGCCACCTTCGAGGACTTCCAGATCCGCCCGCACGCCCTCACGG 
TGCACTCCTATCGGGCGCCTGCCTTCTGTGATCACTGCGGGGAGATGCTCTTCGGCCTAG 
TGCGCCAGGGCCTCAAGTGCGATGGCTGCGGGCTGAACTACCACAAGCGCTGTGCCTTCA 
GCATCCCCAACAACTGTAGTGGGGCCCGCAAACGGCGCCTGTCATCCACGTCTCTGGCCA 
GTGGCCACTCGGTGCGCCTCGGCACCTCCGAGTCCCTGCCCTGCACGGCTGAAGAGCTGA 

gccgSgcaccaccgaactcctgcctcgccgtcccccgtcatcctcttcctcctcttctg 

CCTCATCGTATACGGGCCGCCCCATTGAGCTGGACAAGATGCTGCTCTCCAAGGTCAAGG 
TGCCGCACACCTTCCTCATCCACAGCTATACACGGCCCACCGTTTGCCAGGCTTGCAAGA 

Sctcctcaagggcctcttccggcagggcctgcaatgcaaagactgcaagtttaactgtc 
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FIGURE 2H 

TGTCAGAAATCCTCACGGTGGAGTCCGCCCAGA^OT^^ 

ACCCACACTGCTTTGAGATCGTCACTGC^ 

GCGGGACTCCGGGTGGGCCA^GTGGGCA^ 

CAGCCATCCGCCAGGCCCTGATGCCCGTCATCCT^ 

ACGCGCCCCACAGACAAGCTTCTCTGAGCATCTCTOT 

ATGTGGACATTGCCACTGTCTACCAGAT^ 

TCAGCCACCCCTGGTTACAGGAGTACCAGAC^ 
CCATGGCCTTGATCTCAGCAAAA 

GAATTTGCAGGAAAA 

SEQ ID NO: i 1 -^ 31 ^"" r ArccCA GAAGTCTGTATTACCCACAGCTATTCCT 
AT GTCTGCAAATAATTCCCCTCCATC^ 

GCTGTGCTTCCAGCTGC^ 

TCTAATGGAAGCTTCAGTGCACCATCACTCACCAALlL. C CAGGAACTG 

tcatttctactgcaaattggcctcacacgggaga^ 

TCTTTATCTGCTGTCAAGGATCTT^ 
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FIGURE 21 

GAAAAGATGGTAATGTGCAGAGTGAAAGTTCCA^ 

CGTCCCACGATATGTCAGTACTGCAAGCGGTTACTG^ 

CAGTGTAAAGATTGCAAATTCAACTGCCATAAACGCTGTGCATC 

TGCCTTGGAGAGGTTACTTTCAATGGAGAACCTTCCAGTCTG^ 

CCAATGGATATTGACAATAATGACATAAATAGTGATAGTA^ 

GAAGAGCCATCACCCCCAGAAGATAAC^TGTTCT^ 

GAAAGAGATGAAGAAGCCGTTAAAACAATCAGTCCA^ 

ATGAGGGTTGTACAATCCATCAAGCACACA^^ 
GGGTGGATGGTCCATTACACCAGCAGGGATAACCTGAGAAA^^ 

GACAGCAAATGTCTAACATTATTTCAGAAT^ 

CCACTTTCAGAAATTCTCCGCATATCTTCACCACGAGA^ 

AGCAATCCACACTGTTTTGAAATC^ATTACTGATACTATG^ 

AATGGGGACAGCTCTCATAATCCTGTTCT^ 

CAGAGCTGGGAAAAAGCAATTCGCCAAGCCCTCATGC^ 

TGCACTTCTCCAGGGCAAGGGAAAGATCACAAAGAT^ 

ACACAGATACTTGTTGCTTTGAGGAATCTGCA^ 
AAGCCAGAAAATGTGCTGCTTGCATCAGCAGAGCCATTTC^ 

TTTGGATTTGCACGCATCATTGGTGAAAAGTCATTCA^ 
GCATACTTAGCCCCTGAAGTTCTCCGGAGC^ 
TCAGTGGGAGTTATCATCTA^GAGCCT^ 
GATATAAATGACCAAATCCAAAATGCTGCATTTATC 

AT TTCTGGTGAAGCA^TTGA = 

TACAGTGTTGACAAATCTCTTAGTCA I i ^ ^ TGAAAGTGAT GATGCT 
CCTAATC C AG ATGAT ATGGAAGAAGATC CTTAA 

Sccgccgcctggagcagctgca^ 
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FIGURE 2J 

GTGA^CTGCTTAGTAGC^^ 

GAGCAACTGCCTCCTGCCCACCCTTTGCGCAGCA^ 

GTGCCTGGATACCCCCAGCCTTCAGGGACACCTGTGAAGCCC^ 

CTGCAGGTCCGCCTCCTGGGCTGTGAACAGTTGCTGAC^AGCC^ 

CTGCTGCGCCGCTGCAGCTGCCCGAGGA^ 
TTTCTGTCGGTGCAAGGGCTTGAGTTCATTCAGAAGCTCCTCCAU 

cgcSggggggaggtgagcaggatgcggaggagat^ 

ACCMCTGGCAAGCCCTGCTCGCCCGCAC^^^ 

^gS 

TTTGTGTCAGAGCGATTCCTGGAACCCTGA 

SEQ ID NO: 13_AI021023_M P ^^ rGATTCCTGAAGAT CGCAGACTTTGG 
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WO 00/73469 

FIGURE 2K 

TCTrarTrPGCAAAGTGTGTCCCTTCCCCCTCCAGCTCGCCCTCTTCTACCTCCCAGCGA 

gacSggccS^ 
acttgctttatattaaatttgtaaaagtg 

~aa?atccgggS 
acS?ca^gtSgatctggaagaactctggg^ 

c^cS^ctctttctaccagagtttcccattaggcccattagggggcagcag^ 
^gSgaStoggcctcgtggctaaaggctcctttggaactgtcc 

^2cIgaaag?tgtatttgca^^ 
gaScSga^cagtgcaaagaggaggttagcatccagcga 

^TararArCTTGGGGGACAGCTGGCAGGGAAAACGGCACCTTTTCATTATGTGTAGCTAC 

cScgagSgSaa^tggagaatatt^ 

rarTTTGGTCTGTCCCGCCACGTGCCCCAGGGAGCTGAAGCCTACACTATCTGTGGCACT 

SSgScSggccccagaggtc 

^^^GGGTGTCTTGCTTTTCTCTCTGGCGACTGGAAAGTTTCCAGTGGCTGCAGAG 

Igaga?S?g?ggccSgttggc^^ 

i™^ArrGC?TCTCACTCCTGCTCCATGAGCTCTTATGCCAGAACCCCCTCCATCGT 

SXtaSgcItca^ 

rarCTCCTASGAAGCAGCCAGTGAACTTTGTCACGGAGACACAAGCTACCCAGCCCAGT 

?cagcgg!gacca^gcc^ 

CCTATCCCTGCTTGA 
ATGGAATTCTTTAGGATAGACAGTAA 

rr ArRAA^TTGTATAGTCTAAAATCAGAACCTTTGAAACCATTCTTTACTCTTCCAGAT 

ggaga^gct^tagLgttttaatactagtg^ 

gaca?ca?tagSgL 
tgatgatgtcagtggtactgatgaaggaagacctgatcttcttgtaaatttacctggt 

"AG 

actgaa^tgccaagcacatgaggagaa^ggcatagaggaactgagtgatccctctggg 



:aacaagagaagctgcagcaatgggacctactaagtttacacaaactaat 



rrT , „ G I'ATAACAGAGAAACACTATGCACAGGAGGATCCCAGGATGTTATTT 

S^cctgSgaSI^gtLttcag 

™Sccota?ttgtagtccactctcaggtgctaatgaata^^ 
SaaISSSaIgaagtattgctgtttacagatcagactgatgatttggctaa^ 

rr aACTTCTT^TTCCAGAGAGACTCTGAGACTAAGGGTGAAAGTGGTTTAGTGCTAGAA 

AGG??™CA^AGAGGG^ 

S^I^rrAPArATTCAGCTAACGTATTTTAGCAGGTGGAGTGAGGTTGAAGATTCC 

SagcgSS 

^cSSgcctgtgattggtggagtttgggtgctgtcctctttgaacttctcactc^ 
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FIGURE 2L 

GAACGACTTGGTGCTGGAGTTGCTGGTGTTGAAGATATCAAATCTCATCCATTTTTTACC 
CCTGTGGATTGGGCAGAACTGATGAGATGA 

SEQ ID NO: 16_AA626690_H 

ATGCTACCATTCGCTCCTCAGGACGAGCCCTGGGACCGAGAAATGGAAGTGTTCAGCGGC 

GGCGGCGCGAGCAGCGGCGAGGTAAATGGTCTTAAAATGGTTGATGAGCCAATGGAAGAG 

GGAGAAGCAGATTCTTGTCATGATGAAGGAGTTGTTAAAGAAATCCCTATTACTCATCAT 

GTTAAGGAAGGCTATGAGAAAGCAGATCCTGCACAGTTTGAGTTGCTCAAGGTTCTTGGT 

CAGGGGTCATTTGGAAAGGTTTTTCTTGTTAGAAAGAAGACCGGTCCTGATGCTGGGCAG 

CTCTATGCAATGAAGGTGTTAAAAAAAGCCTCTTTAAAAGTTCGAGACAGAGTTCGGACA 

AAGATGGAGAGGGATATACTGGTGGAAGTAAATCATCCATTTATTGTCAAATTGCACTAT 

GCCTTTCAGACTGAAGGGAAACTGTACTTAATACTGGATTTTCTCAGGGGAGGAGATGTT 

TTCACAAGATTATCCAAAGAGGTTCTGTTTACAGAGGAAGATGTGAAATTCTACCTCGCA 

GAACTGGCCCTTGCTTTGGATCATCTGCACCAATTAGGAATTGTTTATAGAGACCTGAAG 

CCAGAAAACATTTTGCTTGATGAAATAGGACATATCAAATTAACAGATTTTGGACTCAGC 

AAGGAGTCAGTAGATCAAGAAAAGAAGGCTTACTCATTTTGTGGTACAGTAGAGTATATG 

GCTCCTGAAGTAGTAAATAGGAGAGGCCATTCCCAGAGTGCTGATTGGTGGTCATATGGT 

GTTCTTATGTTTGAAATGCTTACTGGTACTCTGCCATTTCAAGGTAAAGACAGAAATGAG 

ACCATGAATATGATATTAAAAGCAAAACTTGGAATGCCTCAATTTCTTAGTGCTGAAGCA 

CAAAGTCTTCTAAGGATGTTATTCAAAAGGAATCCAGCAAATAGATTGGGATCAGAAGGA 

GTTGAAGAAATCAAAAGACATCTGTTTTTTGCAAATATTGACTGGGATAAATTATATAAA 

AGAGAAGTTCAACCTCCTTTCAAACCTGCTTCTGGAAAACCAGATGATACTTTTTGTTTT 

GATCCTGAATTTACTGCAAAAACACCTAAAGATTCTCCCGGTTTGCCAGCCAGTGCAAAT 

GCTCATCAGCTCTTCAAAGGATTCAGCTTTGTTGCAACTTCTATTGCAGAAGAATATAAA 

ATCACTCCTATCACAAGTGCAAATGTATTACCAATTGTTCAGATAAATGGAAATGCTGCA 

CAATTTGGTGAAGTATATGAATTGAAGGAGGATATTGGTGTTGGCTCCTACTCTGTTTGC 

AAGCGATGCATACATGCAACTACCAACATGGAATTTGCAGTGAAGATCATTGACAAAAGT 

AAGCGAGACCCTTCAGAAGAGATTGAAATATTGATGCGCTATGGACAACATCCCAACATT 

ATTACTTTGAAGGATGTCTTTGATGATGGTAGATATGTTTACCTTGTTACGGATTTAATG 

AAAGGAGGAGAGTTACTTGACCGTATTCTCAAACAAAAATGTTTCTCGGAACGGGAGGCT 

AGTGATATACTATATGTAATAAGTAAGACAGTTGACTATCTTCATTGTCAAGGAGTTGTT 

CATCGTGATCTTAAACCTAGTAATATTTTATACATGGATGAATCAGCCAGTGCAGATTCA 

ATCAGGATATGTGATTTTGGGTTTGCAAAACAACTTCGAGGAGAAAATGGACTTCTCTTA 

ACTCCATGCTACACTGCAAACTTTGTTGCACCTGAGGTTCTTATGCAACAGGGATATGAT 

GCTGCTTGTGATATCTGGAGTTTAGGAGTCCTTTTTTACACAATGTTGGCTGGCTACACT 

CCATTTGCTAATGGCCCCAATGATACTCCTGAAGAGATACTGCTGCGTATAGGCAATGGA 

AAATTCTCTTTGAGTGGTGGAAACTGGGACAATATTTCAGACGGAGCAAAGGATTTGCTT 

TCCCATATGCTTCATATGGACCCACATCAGCGGTATACTGCTGAACAAATATTAAAGCAC 

TCATGGATAACTCACAGAGACCAGTTGCCAAATGATCAGCCAAAGAGAAATGATGTGTCA 

CATGTTGTTAAGGGAGCAATGGTTGCAACATACTCTGCCCTGACTCACAAGACCTTTCAA 

CCAGTCCTAGAGCCTGTAGCTGCTTCAAGCTTAGCCCAGCGACGGAGCATGAAAAAGCGA 

ACATCAACTGGCCTGTAA 

SEQ ID NO: 17 AA215680_H 

ATGAGCCTGGTGGCCTGTGAGTGCCTGCCCAGCCCCGGCCTGGAGCCTGAGCCTTGCTCA 
CGAGCACGGTCCCAAGCTCACGTGTACCTGGAGCAGATTCGCAACAGGGTGGCTCTGGGA 
GTGCCTGACATGACAAAACGTGACTATCTGGTGGATGCGGCCACGCAGATCCGGCTGGCC 
CTGGAGCGCGATGTTAGTGAGGACTATGAGGCGGCCTTCAACCACTATCAGAATGGCGTG 
GACGTGCTGCTCCGTGGCATACACGTTGACCCCAACAAGGAGCGACGTGAGGCTGTGAAG 
CTGAAAATTACCAAATACCTGCGGCGGGCAGAGGAGATCTTCAACTGCCACCTGCAGCGG 
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FIGURE 2M 

CCGCTGAGCAGTGGAGCCAGCCCCAGCGCGGGTTTCAGCAGCCTGAGGCTCCGGCCCATT 

CGCACGCTGAGCTCTGCCGTGGAGCAGCTGAGGGGCTGCAGGGTGGTCGGGGTCATCGAG 

AAGGTGCAGCTGGTCCAGGACCCGGCAACCGGAGGGACCTTTGTGGTGAAGAGCCTACCC 

AGGTGCCACATGGTGAGCAGGGAGCGGCTGACCATCATCCCACACGGAGTCCCCTACATG 

ArPAAGCTGCTCAGGTACTTTGTGAGCGAGGACTCCATCTTCCTGCACCTGGAGCATGTG 

CAAGGAGGCACTCTCTGGTCCCACCTGCTCTCCCAGGCGCACTCCCGACATTCTGGGCTC 

AGCTCTGGCTCTACCCAGGAGAGGATGAAGGCTCAGCTCAACCCCCACCTCAACCTCCTG 

ACCCCAGCGAGGCTTCCCTCAGGCCATGCCCCTGGCCAGGACAGAATCGCCCTGGAGCCT 

rCTAGGACTTCTCCGAACCTTCTCCTAGCTGGGGAGGCCCCATCCACCAGACCCCAGAGG 

GAGGCTGAAGGTGAACCCACAGCCAGGACCAGCACCTCTGGCTCCTCGGACCTTCCAAAG 

GCCCCAGGTGGCCACCTGCACCTTCAAGCTAGGAGGGCTGGCCAGAACTCAGACGCTGGG 

CCCCCTCGGGGGCTCACTTGGGTTCCTGAGGGGGCCGGCCCGGTGCTAGGGGGCTGTGGC 

CGAGGCATGGATCAGAGCTGCCTGTCAGCAGATGGGGCCGGCCGGGGCTGTGGCAGGGCC 

ACCTGGAGTGTGAGAGAGGAGCAGGTGAAGCAGTGGGCGGCAGAGATGCTGGTAGCGCTG 

GAGGCGCTGCACGAGCAGGGGGTGCTGTGCCGGGACCTCCACCCCGGGAACCTGCTCCTG 

GACCAGGCAGGTCACATCCGGCTCACATATTTTGGCCAGTGGTCAGAGGTGGAGCCCCAG 

TGCTGCGGGGAGGCCGTGGACAATCTCTACAGCGCCCCAGAGGTGGGTGGGATTTCCGAG 

CTGACGGAAGCCTGTGACTGGTGGAGCTTTGGGTCTCTACTGTATGAACTGCTGACGGGA 

ATGGCACTGTCCCAGAGCCACCCTTCAGGAATCCAGGCCCACACCCAGCTCCAGCTGCCC 

GAGTGGCTCAGTCGCCCAGCGGCCTCTCTGCTGACTGAGCTGCTGCAGTTCGAGCCTACC 

CGGCGCCTGGGCATGGGAGAAGGTGGTGTCAGCAAACTCAAGTCCCATCCCTTTTTCAGT 

ACCATCCAATGGAGCAAGCTGGTGGGGTAA 
cpn ID NO: 18 SGK H 

ATGACGGTGAAAACTGA^GCTGCTAAGGGCACCCTCACTTACTCCAGGATGAGGGGCATG 

PTrCCAATTCTCATCGCTTTCATGAAGCAGAGGAGGATGGGTCTGAACGACTTTATTCAG 

AAGATTGCCAATAACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATC 

TCCCAACCTCAGGAGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCT 

CAGCAAATCAACCTTGGCCCGTCGTCCAATCCTCATGCTAAACCATCTGACTTTCACTTC 

TTGAAAGTGATCGGAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCAGAA 

rAAGTGTTCTATGCAGTCAAAGTTTTACAGAAGAAAGCAATCCTGAAAAAGAAAGAGGAG 

AAGCATATTATGTCGGAGCGGAATGTTCTGTTGAAGAATGTGAAGCACCCTTTCCTGGTG 

rGCCTTCACTTCTCTTTCCAGACTGCTGACAAATTGTACTTTGTCCTAGACTACATTAAT 

GGTGGAGAGTTGTTCTACCATCTCCAGAGGGAACGCTGCTTCCTGGAACCACGGGCTCGT 

TTCTATGCTGCTGAAATAGCCAGTGCCTTGGGCTACCTGCATTCACTGAACATCGTTTAT 

AGAGACTTAAAACCAGAGAATATTTTGCTAGATTCACAGGGACACATTGTCCTTACTGAT 

TTCGGACTCTGCAAGGAGAACATTGAACACAACAGCACAACATCCACCTTCTGTGGCACG 

CCGGAGTATCTCGCACCTGAGGTGCTTCATAAGCAGCCTTATGACAGGACTGTGGACTGG 

TGGTGCCTGGGAGCTGTCTTGTATGAGATGCTGTATGGCCTGCCGCCTTTTTATAGCCGA 

AACACAGCTGAAATGTACGACAACATTCTGAACAAGCCTCTCCAGCTGAAACCAAATATT 

ACAAATTCCGCAAGACACCTCCTGGAGGGCCTCCTGCAGAAGGACAGGACAAAGCGGCTC 

GGGGCCAAGGATGACTTCATGGAGATTAAGAGTCATGTCTTCTTCTCCTTAATTAACTGG 

ga?gaotcattaataagaagattactccccctt™ 

CAGCTACGGCACTTTGACCCCGAGTTTACCGAAGAGCCTGTCCCCAACTCCATTGGCAAG 

tcccctgacagcgtcctcgtcacagccagcgtcaaggaagctgccgaggctttcctaggc 

TTTTCCTATGCGCCTCCCACGGACTCTTTCCTCTGA 
cpn in NO • 19 AA107515 M 

rGGGTCGACCCACGCGTCCGCCGGTTTCACTGCTCCCCTCAGTCTCTTTTGGGCTCTTTC 

SgStcgggacgatgaccgtcaaagccgaggctgctcgaagcacccttacctactcca 
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GAATGAGGGGAATGGTAGCGATTCTCMCGCTTT^ 

ACGATTTTATTCAGAAGATTGCCAGCAACAC^^ 
CCATTTTGAAAATGTCCCATCCTCAGGAGCCGGAGCTTATC 

CGCCAAGTCCCTCTCAACAAATCAACCTGGGTC^ 
CCGACTTTCACTTCTTGAAAGTGATCGGAAAGGGC^AGTTTTG^ 

GGCACAAGGCAGAAGAAGTATTCTATGCAGTCAAM 

AGAAGAAAGAGGAGAAGCATATTATGT^ 

CGGTGGACTGGTGGTGTCTTGGGGCTGTCCTGTATG^^ 
AACCAAATATTACAAACTCGGCAAGGCACCTCCTGG^ 

CCAGCTGACAGAACATTTTAAAAGAATTTGCACACCTGGAAG^ 
CCGGCGTGGCGCGACGCAGCGCGCGCTGCTTGATGGGAG^ 

CTCTCAATCBAGCTTGTC^GGTCTTCTTTTCTTCTCTl^i TTA GAAGG AAG 

Lgcgagcgagcgtgagagtgccgcctga^ 

ATGCAGGTCTAAGAGGAATCCCCGCAGGTCTGTCT 
ATGTGCCTTTTCTGAGATCGTGTTAG^ 

2aattttggtttgcaataaaatcttgaaaact 

SEQ ID NO: 20_AA109508_M __- r , / , r , r . r p/-'ppr'aGGTTCTACGCTGCTGAGGT 

ccacctgcagcgggagcgccggttcctggagc^ 

GGCCAGCGCCATTGGCTACCTGCACTCC^ 
GAACATTCTCTTGGACTGCCAGG^^ 
AGGTGTAGAGCCTGAAGACACCACATC^ 
TGAAGTGCTTCGGAAAGAGCCTTATGATCGAGCA^ 
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FIGURE 20 

^GGTTTATTTTTTCCAGCACAT^GAAW^TAATGT TGACAAATGG CTT 

SSatcaatatctcttattgttct 

GGAGATATGCAGAGTTTGATAAACTTTATAACACT ATC CAGATTTTATTAAAC 
^TCCAGATGTCAGAGCATTCCTTCAAATGGAC^ 

CTGAACGTAATGTGCTCTTGAAAAATG^TGAAACATC 

rrTTCCAAACAACTGAAAAGCTTTATTTTGTTCTGGAlli TCTGA CACCACTA 

TGATTAAAATTTATATTCTTGTTTAATAAGCTTAT 
AAATATTTGGTTCATGGTATGATCGA^^ 
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FIGURE 2P 

CCTGGGCAACAAGAGTGAAACTCCATCTCC 
ATGGCGCACCA^CGGGCATC 

Sggacsccca^^ 

TTCGAATGGCTCTTCCTCGCCTGGTCGCCTGAT^ 
TACGCGGCCACGCGGGCCACAGTGAAAAAGGAGTTTC 

CTCTTCGGGACTGTGAAGGATGACCTCTCTTTl^i CTCCAGC AGATCCGCATTAAC 
TGTGCGGCACCTGCCCCGCTGACCTCGGCTGAGAG^AGAGCT^ 

GAGGTGAAGACAGAGATCAGTGT^^ 

CCCCTGCAGCCTGAGGCCCAGCGGGCACTCC^^ 

ATCCAGATGAAGCTGGACCTAGAGCGGGAAACCA^TGAG^ 

GATGTGGCCCAGCTGCCCTCCCGGGTGCCCCGAGATG^ 

TACAAGCACACCCATGAGGGCGACCCCCTTGAOT 

GGGTACAAGTGCAGCATCAAGGAG^ 
GACTCCGTGGAGCAGGACTTCCATCTGGAGATCGCCAA^ 

GGCCCGGGTGAAAATGGGGATGACAGCTAG 

TGGCTGCGCTGGGAGGCGGCGGTGAGAGG^ 
CGGGAGGGAGAGCCGAGCAGCCCCGGCTCTGG^ 

CCACCCGGCGAAGTGCACACACCCAGA^^ 

ACAATATATGTGCTCTGCTCTCC^^ 
CACCAGGTCTGAATTCCAGACTCCTCCCCACCA^ 

GACCACAGACCCATTCAGGGAGGCTGGC^ 

TGTCAAAGCTGAGAATGAAGCGGAGAG^ 

GGGCCAAGGCTCTAGC^GTGGGAmCTGGA^^^ C AGTGTTTGGCGAGGA 
KftCCTCTGAAATCGTT^G^CR^TCCMCTOGATGGT CCGCTGRCC 

ACATGTGGGCCCTGGGCGTGGTGCTOT^ 
ACAGCATCCCGCAGGAGCTCTTCCGCAAGAT^ 
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FIGURE 2Q 

AGTCCCTGTCATCTCTGAGTGGGCCTTTGCAAGTGGTTCCTGACATTGATGACCAAATGA 
GCAATGCGGATAGCTCCCAGGAGGCGAAGGTGACGGAGGAGTGCTCCCAGTACGAGTTTG 

agaIctacatgcgtcagcagctgctgctggccgaggagaagagctccatccatgacaccc 

GGAGCTGGGTACCCAAGCGGCAGTTCGGCAGCGCACCACCGGTGCGACGGCTGGGCCACG 
ACGCACAGCCCATGACCTCCTTGGACACGGCCATCCTGGCGCAGCGCTACCTGCGGAAA 

TCGCTGTCAGGGCTGGGCCCTGTAGTGCTGGACTCTCCCGGGCCACAATAGGGACAG^ 

ArrGACAGGGACAGCCCAGGTCACACGTGGGGTCAGCAGAGGTACCACGAAGCTACCTTT 

TGGGATGATTGCTCGATTGTTTGGTTTTTAAATCTGAGAAGCCTAGATAACTAATCT 

TTTAATCACGATGTTTTAATCTACCTCTGTCTCTTTAACCATGCTGTCTCTGGACTGAGG^ 

AAGA^GGAdGAGGGAGCCTGCTCACCCCACTCCAGGGCCTTCCCCAGCGGCCACCAACTC 

ACCTGGGGCGCTGCTCCCCACAGTCCAAATAAGCTGAAAGTGCAGCTCGCTGCAGGCCCC 

agag?gagcttcccctcctccctgctctcccaggcccctgccacagcctctttc 

TCTCT^CTGATCCAGGCCCCTCAGTCCAAGCTTTGGAAAACCTTCACCTCATCTTAAAC 
CAAACTCAAATATATTTATTTTTTTACCAT 

gSgcgatcgccagS^^ 

?cg?ggccggggtcgcggagaggggcccccagctcctccgggggcagcagca^^ 
cc^ggggaacgggctcatccccagtccggcgcacagtgcccactgcagcttctaccg^ 

ArrCGGACCCTGCAGGCCCTCAGCTCGGAGAAGAAGGCCAAGAAGGCGCGCTTCTACCGG 

^?ggggaccgcSotcLgggcct 

gg?ScS2aSatctacaccatcgacggcagccggaaggtcaccag 
??5gSS?gagagttacgtgtgtgc 

g^ctcctctgtgaaaagtgaagtaaaagaaagtaaagatttcatcaa^^ 
gtcactgtgattcgaagtggagtgaagcctagaaaagccgtgcggatccttctgaataaa 

SgaSgS^tcctttgaacaagtcttaacagatatcaccg^ 
^aSgIg?cgtcLgaggctctgcaccctggatggaaagcaggtgagagttacgtgt^ 

CATCTGCCAGACTTTTTTGGTGATGACGATGTTTTTATTGCATGTGGACCAGAAAAATTT 
rGTTATGCCCAAGATGACTTTGTCCTGGATCATAGTGAATGTCGTGTCCTGAAGTCATCT 

Ig?caga??tctgctcatggcagatcttcttcc^^^ 
cg5tc?a™gtcctgaaggtgtgaatggaaacagatgctctgaatcatca^ 

gagaaata^aaaattggaaaggtc^^ 

Saga^Igg^Sggaaaggagtttgccctaaagat^^ 

gga^ggaacacctgattgagaatgaagtgtcaatactgcgccgagtgaaacat^ 

a?^?a^Sggtcgaggag 

rTrAAAGGTGGAGATCTCTTTGATGCAATTACTTCGTCGACCAAGTACACTGAGAGAGAT 

ggca^gcca?gg?gS^ 

GTGCACAGAGACATCAAACCAGAGAATCTCTTGGTGTGTGAATATCCTGATGGAACCAAG 
TCTTTGA^CTGGGAGACTTTGGGCTTGCGACTGTGGTAGAAGGCCCTTTATACACAGTC 
TGTGGCACACCCACTTATGTGGCTCCARAAATCATTGCTGT^AACTGGCTATGGCCTGAAG 

g?ggacaStggSca^tggtgtgatcacatacatacttct CT ^ 
cgSgtgagaacaatctccaggaagatctcttcgaccagatcttggctggg^ 

TT^CGGCC^C^CTGGGATAACATCACGGACTCTGCCAAGGAATTAATCAGTCAAATG 

cSSgSaaatgttgaagctcggtgtaccgcgggacaaatcctgagtcacccctg^ 

Sga?gI^ctcccaggagaataacatgcaag™ 
cactttaataatgcgctccccaaacagaacagcactaccaccggggtctccgtcatcatg 

US/ 11} 
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FIGURE 2R 

GGTAGCCGGGAGCTGCCCTCACATGGAAGTCGGTGCLl^ ^tcsgcaCTGTGT 

Sggcttggcatgtggag^c^^™^ gtcmggr gctc 

^TGTCAGTGACAACGATG^^ 
GCTG 



iTCCAGGCCTGAGCCAATGCA^CCCTGGGCGGGAA 



GGAACAGGGGTGGAGTGGCCTTTCCCA^ 

GACCTGCCAAGGTGATGGGCGTCTGAGTTTCACAI CTCGTACT TGGGACAGGCCT 
?CCTGACAGCTAAGGATGGGCC^ 

CTCATCCTCTGGGAAGGTCCTCCTTGTTTCCTA GACG CAGGGTCTAGATAGAA 
TTCTCATGGTACATGGTTGTCTGAAAGCCTTACCTA aacATCCCCCACACTCC 

™^ 

SEQ ID NO: 25_W30246_M SG ^^-^ rrGGAAGTTTCA GAGGATTGAAGATTTCT 
CCTGAAGGTGTGAATGGAAACCGGTGCTCCGAG^ 

GATCTCTTTGATGCGATTACCTCTTCAACCAAG^ 
TGGCAGTTTTCCCTGTTTTCCACCACCCCACTCTT^ TCACTCTCACT GTTAGATTT 



TGCCTTTTCCAGGTATCCCC 
^EO ID NO: 26 AA383293_H ^^„ rr ~ rACCCATT CTTCCCAGGCTCCCAG 

gSgtgcaggccccactggctgtgcg^ 
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WO 00/73469 

FIGURE 2S 

GGCAATGAAGCCCACCTGAGGAGTGGAGT^GGACTGT^ 

GACTTTTTTGGTGATGACGATGTTTTT^ 
CAAGATGACTTTGTCCTGGATCAT^ 




GAGCCCAAGACC^GGCCAGAAGAG^CA^GCCAGAC ^ C CGGGTCATTGGG 
CCCATGGGCATCATTGCCGCCAATGTGGAAW 
GATGGGAACTTTGCTGTCGTGAAGGAG^^ 
ATGAAGATCATTGACAAGTCCAGACT^^ 
TTGATCATCCAGAGCCTCTCTCA^ 
GACATGGAAATCTACCTGATCCTGGAGTAC^ 
ATAGAAAGTGTGAAGTTCCCGGAGCCCGATGCTGCCCT 
GCCCTCGTCCACATGCACGACAAGAGCATTGTCCAC^ 
TTGGTTCAGCGAAATGAGGACAAATCTACTACCTTG^ 
AAGCATGTGGTGAGACCTATATTTACTGTGTGT^ 
ATTCTTTCTGAGAAAGGTTATGGA^ 

TATATCCTGCTGTGTGG^ 

CTCTTTAACATCATCCAGCTGGG^ 

TCTGATGCTGCTAAAGATCTGGTGAGCCGGTTG^ 

ACAGCTCATCAGGTTCTTCAGCACCC^ 

CAATGTTAAATGTCACAACATATTT^mGWTMT^ GTTTTGCTTCCC 
CTGTGAGATTAATAAGGTGCATTG 

SEQ ID NO: 28_AA197883 M CGCCAGCGACC CCCGCCCCGCCGGCA 

ATGCCAACCGCGCCGGTCCTGCGCC^ 

CCCAGTCGCCCTGCGCCTCCCATTCCGGGCCACCG^ 

TGCTTAAGCTCGAAGATCTCTGAGAG^AAAGCTGCCAG^ 

GGACCTCTGGAGAAGCCAGTTCTGGGGCC^ 

CAGAGCAGCCTCCACTCAGTCCGCGCAGAGCACAG^ 
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FIGURE 2T 

AGGCATCAGGGCAAGACTTCCACAGTGCTGGCCC^ 
TGGGTAAGAGGGAAACAGGAGTCAGAACCTGG^TGGCCCGCG^ 

CAGGAGGAGACTC^TGCAAGTGGAGAGA^CATCTGGG^ TTGG GCCTGCAG 

GGGGAGATTGTCAGATGTGAGAAGTGTAAGAGAGA^ 

AGGGAGCCGTGCCCGCTGGGAACCAGTGAGCTG^ 

GAGAAGCAAGCAGAGCACGAGAAGAAGCCAGGCGGCT^ 
AAGGAGTCTAAGAGGAAGCTAGAA^ 

AGGCCCAAGGGCATCATCTCAGCGGATGTGGAGAAGLAU aaqCAGGCTTAC 

GGGGATGGCAACTTTGCCACCGTG^ 

GCCATGAAGATGATTGACAAGTCCCAGCTC 

ATTTTAATCATCCAGAGTCTCTCT^ 
ACGGAGGCGGAGATCTACCTGATCATGGAGTATG^ 

ATCGTTGAAAATGTGAAGTTTCCAGAGCCC^ 
AAGGCCTTCGTCCACATGCACGACAAGAATAT^ 

GTTGCAGAGCAGATGCCATAA 

SEQ ID NO: 2 9_DRAK2 H _ CGAGAAG CCAGGTCACAACCGTCGGCCCTTG 

CTCCGCTGCTGTCGCCAGGAGTCACTTCACGAGAK C GCCTCGCCCGGCTG 
TCTGGAAAAGTAAAAGTGGATCCTGCCACGTTCG^ 
GAGCTAGAGAACTCGTCCTGTGGCGGCCCCCGGCG^ 
GGGGGCAGTCCCGGGAGAACCTGCGGCGGCCGG^ 

GCAGGTGGAGAAATTTTCAGCCTGTGT^ 

GATGTTATCAGACTCATTAAAC^^ 
ATTGTACACCTTGATTTAAAGCCACAGA^TATA^ 

GACATTAAAATAGTAGATTTTGGAATGTCTCGA^ 

GAAATCATGGGAACACCAGAATATTTAGC^ 

ACAGCAACAGATATGTGG^TA^^ 
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FIGURE 2U 

TRTGATTTGTAGCTTCATATM^CATGmATATTGTWRTQ ATGATT TCATATCC 
GTGAAAG 

SEQ ID NO: 30_W44160 M DRAK2 M CGTCGGTCAGCG TCTGTGCG 

CC*GACGCGGCTGCACTTTTC^ 

GTCGCCGCCGGGAGTCGCCTCACAGGGGCCTGGCTGALO^ 

GAGTGCGAGGTAAAAGTCTGCCTAGAGAA^ 
GGAGATTCGATTGCCGAAGTGTCTCAGGCTTGCTAACT^ 

AAACAGAGAATTTTAATAATTTCT^^ 

TACTTTTGAGCAGTATATACCCACTCGGGGACA 

GAAAAATTGGGAATGCAAGTGAGCTTCGGGAAA^ 

CAGAAATCCTCAACTATGATCCCATTACCACA^ 

tagcgtatatgttgttaactcatacatcaccat™^ 



ATCTGAATATTTCTCAAGTGA^TCT 

agctggccacagacttcatccagagcc^ 

CAGAATCCTGCCTATCCCACTCATGGCTGCAGuA^i 

ctgaggaaacttcaggctcctctcaaattcagg^atctga^ 

agacctccaagtcctcctgta^tgggagctgt^ 

ctgaagatggcagcttagtttctaaaa^ 

ATGAACTTGTTCCAGATTTGTTCTGTTAGCAm 

cggaaatttgaaatctctggtgtga^attgtg^ 

TAGTTGAAAGTATTTCCCAGTTACC^^ 

GCC*TGTGCTCGG^GAGGA»CAGTCAACCAGAGC T „ TCTACGTGCATCA TA 
TGCAAACCGAGTCAAAACTCGACATCATTTC^ mgagGCCTTTGTCTTCTAGA 
TATCAGATCTAATAAGATCTGGAAGATGGAraTGCAAA AGTTTCAGT CAGACATA 
ATGATTAGAGTAGAGGAGAATTGGATAGTA^^ 

TTCATAAAS^TOTTAAGTO^G^CTAGTG ^^^naK 
CAGGATAGAAGCCCATCATTAGAGCATAGGCACTTC^ 
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FIGURE 2V 

gSagataaaatacagcctttaaacaacttc 

qPO ID NO: 31 H0124 8_H, DRAK1 JJ cCAGGCGC CACCTCAGGCTCG 
RTGATCCCTTTGGAGAAGCCAGGCAGCGGCG^GCTCCTCC^C 

ggccgSgcLgccggggtctgagcgggc^ 




?gIgaaattatgggtacccctgaatatgtgg^ 

^A^CAACAGATATGTGGAGCATTGGAGTGTTAACAi^ CAC AGATGAATTTA 

ScStcttaggcaatgataaacaagaaa^ 

SSreCTKTATCAAGATCATAGRTJ^GACCCftGCTGGA ATCRTC AGGCTCTACCA 



:ttggccatggaggac 
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FIGURE 2W 

TGCCTATGATCACTATAGTGCAATCTACAGCCTGCTGTGTGATCGACATAAGAGACATAA 

AACCCTGCGTCTCGGAGCACTTCCTAGCATGCCCCGAGCCCTGGCCTTTCAAGCACCAGT 

CAATATCCAGGCGGAGCAGGCAGGTACTGCTATGAACATCAGCGTTCCCCAGGTGCAGCT 

GATCAACCCAGAGAACCAAATTGTGGAGCCGGATGGGACACTGAATTTGGACAGTGATGA 

GGGTGAAGAGCCTTCCCCTGAAGCATTGGTGCGCTATTTGTCAATGAGGAGGCACACAGT 

GGGTGTGGCTGACCCACGCACGGAAGTTATGGAAGATCTGCAGAAGCTCCTACCTGGCTT 

TCCTGGAGTCAACCCCCAGGCTCCATTCCTGCAGGTGGCCCCTAATGTGAACTTCATGCA 

CAACCTGTTGCCTATGCAAAACTTGCAACCAACCGGGCAACTTGAGTACAAGGAGCAGTC 

TCTCCTACAGCCGCCCACGCTACAGCTGTTGAATGGAATGGGCCCCCTTGGCCGGAGGGC 

ATCAGATGGAGGAGCCAACATCCAACTGCATGCCCAGCAGCTGCTGAAGCGCCCACGGGG 

ACCCTCTCCGCTTGTCACCATGACACCAGCAGTGCCAGCAGTTACCCCTGTGGACGAGGA 

GAGCTCAGACGGGGAGCCAGACCAGGAAGCTGTGCAGAGGTACTTGGCAAATAGGTCCAA 

AAGACATACACTGGCCATGACCAACCCTACAGCTGAGATCCCACCGGACCTACAACGGCA 

GCTAGGACAGCAGCCTTTCCGTTCCCGGGTCTGGCCTCCTCACCTGGTACCTGATCAGCA 

TCGCTCTACCTACAAGGACTCCAACACTCTGCACCTCCCTACGGAGCGTTTCTCCCCTGT 

GCGCCGGTTCTCAGATGGGGCTGCGAGCATCCAGGCCTTCAAAGCTCACCTGGAAAAAAT 

GGGCAACAACAGCAGCATCAAACAGCTGCAGCAGGAGTGTGAGCAGCTGCAGAAGATGTA 

CGGGGGGCAGATTGATGAAAGAACCCTGGAGAAGACCCAGCAGCAGCATATGTTATACCA 

GCAGGAGCAGCACCATCAAATTCTCCAGCAACAAATTCAAGACTCTATCTGTCCTCCTCA 

GCCATCTCCACCTCTTCAGGCTGCATGTGAAAATCAGCCAGCCCTCCTTACCCATCAGCT 

CCAGAGGTTAAGGATTCAGCCTTCAAGCCCACCCCCCAACCACCCCAACAACCATCTCTT 

CAGGCAGCCCAGTAATAGTCCTCCCCCCATGAGCAGTGCCATGATCCAGCCTCACGGGGC 

TGCATCTTCTTCCCAGTTTCAAGGCTTACCTTCCCGCAGTGCAATCTTTCAGCAGCAACC 

TGAGAACTGTTCCTCTCCTCCCAACGTGGCACTAACCTGCTTGGGTATGCAGCAGCCTGC 

TCAGTCACAGCAGGTCACCATCCAAGTCCAAGAGCCTGTTGACATGCTCAGCAACATGCC 

AGGCACAGCTGCAGGCTCCAGTGGGCGCGGCATCTCCATCAGCCCCAGTGCTGGTCAGAT 

GCAGATGCAGCACCGTACCAACCTGATGGCCACCCTCAGCTATGGGCACCGTCCCTTGTC 

CAAGCAGCTGAGTGCTGACAGTGCAGAGGCTCACAGCTTGAACGTGAATCGGTTCTCCCC 

TGCTAACTACGACCAGGCGCATTTACACCCCCATCTGTTTTCGGACCAGTCCCGGGGTTC 

CCCCAGCAGCTACAGCCCTTCAACAGGAGTGGGGTTCTCTCCAACCCAAGCCCTGAAAGT 

CCCTCCACTTGACCAATTCCCCACCTTCCCTCCCAGTGCACATCAGCAGCCGCCACACTA 

TACCACGTCGGCACTACAGCAGGCCCTGCTGTCTCCCACGCCGCCAGACTATACAAGACA 

CCAGCAGGTACCCCACATCCTTCAAGGACTGCTTTCTCCCCGGCATTCGCTCACCGGCCA 

CTCGGACATCCGGCTGCCCCCAACAGAGTTTGCACAGCTCATTAAAAGGCAGCAGCAACA 

ACGGCAGCAGCAGCAGCAACAGCAGCAACAGCAAGAATACCAGGAACTGTTCAGGCACAT 

GAACCAAGGGGATGCGGGGAGTCTGGCTCCCAGCCTTGGGGGACAGAGCATGACAGAGCG 

CCAGGCTTTATCTTATCAAAATGCTGACTCTTATCACCATCACACCAGCCCCCAGCATCT 

GCTACAAATCAGGGCACAAGAATGTGTCTCACAGGCTTCCTCACCCACCCCGCCCCACGG 

GTATGCTCACCAGCCGGCACTGATGCATTCAGAGAGCATGGAGGAGGACTGCTCGTGTGA 

GGGGGCCAAGGATGGCTTCCAAGACAGTAAGAGTTCAAGTACATTGACCAAAGGTTGCCA 

TGACAGCCCTCTGCTCTTGAGTACCGGTGGACCTGGGGACCCTGAATCTTTGCTAGGAAC 

TGTGAGTCATGCCCAAGAATTGGGGATACATCCCTATGGTCATCAGCCAACTGCTGCATT 

CAGTAAAAATAAGGTGCCCAGCAGAGAGCCTGTCATAGGGAACTGCATGGATAGAAGTTC 

TCCAGGACAAGCAGTGGAGCTGCCGGATCACAATGGGCTCGGGTACCCAGCACGCCCCTC 

CGTCCATGAGCACCACAGGCCCCGGGCCCTCCAGAGACACCACACGATCCAGAACAGCGA 

CGATGCTTATGTACAGCTGGATAACTTGCCAGGAATGAGTCTCGTGGCTGGGAAAGCACT 

TAGCTCTGCCCGGATGTCGGATGCAGTTCTCAGTCAGTCTTCGCTCATGGGCAGCCAGCA 

GTTTCAGGATGGGGAAAATGAGGAATGTGGGGCAAGCCTGGGAGGTCATGAGCACCCAGA 

CCTGAGTGATGGCAGCCAGCATTTAAACTCCTCTTGCTATCCATCTACGTGTATTACAGA 

CATTCTGCTCAGCTACAAGCACCCCGAAGTCTCCTTCAGCATGGAGCAGGCAGGCGTGTA 
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FIGURE 2X 

ACAAGAAACAGAGAGTTTTGTGTACAGCTTGGGAATGAAAAGGTTGATTGTAAACCCACA 
GTATCTAGCAGCGTTGTGCCAAATTGCCCTTGTGTTTCTCTCCACCCAAAATATCACAGC 
TGCTTTCCTCACATTTGGTTCATCCGTGTGCTGTTCTTTTGGGTTCTGAGAGGGTTTTGC 
CATGTTTGCTTGTATGACCAAGTCACCAAGGAAATAAACAGGAAGGAAATCCATGTTCTC 



SEQ ID NO: 33_2R22-5-ll_H 

CTGGGCCGCTGCCGGTCAGGTCGGCCGCCCCTGACAGCTCCGGGAGCCTCAAGCGCGACA 

GGGCGCCCTCACCTCGGGACATCCACACACCGACCGCTCCTGCTCCAGAGGCAACAACCC 

AGCGCGCCTAGCCTGGCGCCGTGCAGCGAAGCCCAAGAGCTGGCCTCGCCACGAAGGTTG 

AACCAGCCAAATTTTCGAGACAGCTCACGGCTTAGAGGAAGGTTCATCTAAATAAAGGCC 

GGCTAAAGTGACATTGCAGGGATTAAATCCTTCTTTGGCTGCCTGTGTGACCAGAAGGCT 

TATTTGCAAGTTTCTTCTTTCCTGGGGTCCAGATTATTAGGTCTCCAGCGCCCTGCAGCT 

TGACAGAAAGAGAAGCATGAAATGAAGGTCAGAGATGAGATCCCGCAGCAGGGACGTGGG 

GGCCTCCCAGGGGCATTTACGCACCAGAGTGCAAGATTCTCTGGCCATCAAGGGAAATAG 

CAAACAGAAGCCTTTGTCCTGGGGCACAGCCACCTACCACAAAGCATCAGACTCCACGTC 

TGGCCAGAAAGTTCCTGGAGTCCCATCAGGCCAGTGGGTATGTAACATGTGCCTAATTGT 

ACAGCTAGAGCCTGCAAGTTCAACGTGAGGGAAGGTGGGAAATGTCTTGAGTGAGGCGAG 

CAGCTCCTGGCTGGGCTGGGCAGACTCAGCTACCACGTTCACTGCCTTCCTCTCACTAAA 

GCCGAGAGGGAGGCTGCTCAGCTCTCAGGAAAACTCTTTTGAACCCTGGGCACCTGCTGT 

CCTCAGTTGGCATCTCCCACCCTCTGAGCCTCTTCTGCTCCTGCACAACCTGCCTCTTCG 

CTGAGATGGAGACGTGAGCCCCCGTGGACGATGACTGCAGTGTATATGAATGGAGGTGGC 

CTGGTGAACCCCCACTATGCCCGGTGGGATCGGCGCGACAGTGTAGAAAGTGGCTGTCAG 

ACCGAGAGTAGCAAGGAGGGTGAGGAGGGACAGCCCCGCCAGCTGACGCCCTTCGAGAAA 

CTGACACAGGACATGTCCCAGGATGAGAAGGTGGTGAGGGAGATCACGCTGGGGAAACGG 

ATAGGCTTCTACCGAATTCGAGGGGAAATCGGAAGTGGAAACTTCTCCCAAGTGAAGCTT 

GGGATTCACTCCCTAACCAAAGAAAAGGTGGCCATTAAGATCCTGGACAAGACCAAGTTA 

GACCAGAAAACCCAGAGGCTACTATCCCGAGAAATCTCCAGCATGGAAAAGCTGCACCAT 

CCCAACATCATCCGCCTTTACGAAGTGGTGGAGACCCTATCCAAGCTGCACTTGGTGATG 

GAGTATGCAGGGGGTGGGGAGCTCTTCGGAAAAATTAGCACTGAGGGGAAGCTCTCTGAA 

CCAGAAAGCAAGCTCATCTTCTCCCAGATTGTGTCTGCCGTGAAGCACATGCATGAAAAC 

CAAATTATTCATAGAGATCTGAAAGCAGAAAATGTATTCTATACCAGTAATACTTGTGTG 

AAGGTGGGCGATTTTGGATTCAGCACAGTAAGCAAAAAAGGTGAAATGCTGAACACTTTC 

TGTGGGTCTCCTCCCTACGCTGCGCCTGAACTCTTCCGGGACGAGCACTACATCGGCATT 

TACGTGGATATCTGGGCCTTGGGGGTGCTTTTGTACTTCATGGTGACTGGCACCATGCCA 

TTTCGGGCAGAAACCGTGGCCAAACTAAAAAAGAGCATCCTCGAGGGCACATACAGTGTA 

CCGCCGCACGTGTCAGAGCCCTGCCACCGACTCATCCGAGGAGTCCTTCAGCAGATCCCC 

ACGGAGAGGTACGGAATCGACTGCATCATGAATGATGAATGGATGCAAGGGGTGCCATAC 

CCTACACCTTTGGAACCTTTCCAACTGGATCCCAAACATTTGTCGGAAACCAGCACTCTC 

AAGGAAGAAGAAAATGAGGTCAAAAGCACTTTAGAACATTTGGGCATTACAGAAGAGCAT 

ATTCGAAATAACCAAGGGAGAGATGCTCGCAGCTCAATCACAGGGGTCTATAGAATTATT 

TTACATAGAGTCCAAAGGAAGAAGGCTTTGGAAAGTGTCCCAGTCATGATGCTACCAGAC 

CCTAAAGAAAGAGACCTCAAAAAAGGGTCCCGTGTCTACAGAGGGATAAGACACACATCC 

AAATTTTGCTCGATTTTATAAATTGCACTAGACTGCTTGTAACTAACCAAGATGATTGTT 

GCTGCTTCTAAATTTTTTTCAAGGACAACTTGAGTGGAGACATTTTTGTAATTTTTAAAT 

AAACTTAAATTTGAGATATGCAAAAAAAAAA 
SEO ID NO: 34 R31237_1_H, AAC33487 

ATGTCCACTAGGACCCCATTGCCAACGGTGAATGAACGAGACACTGAAAACCACACGTCA 
CATGGAGATGGGCGTCAAGAAGTTACCTCTCGTACCAGCCGCTCAGGAGCTCGGTGTAGA 
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FIGURE 2Y 

AACTCTATAGCCTCCTGTGCAGATGAACAACCTCACATCGGAAACTACAGACTGTTGAAA 

ACAATCGGCAAGGGGAATTTTGCAAAAGTAAAATTGGCAAGACATATCCTTACAGGCAGA 

GAGGTTGCAATAAAAATAATTGACAAAACTCAGTTGAATCCAACAAGTCTACAAAAGCTC 

TTCAGAGAAGTAAGAATAATGAAGATTTTAAATCATCCCAATATAGTGAAGTTATTCGAA 

GTCATTGAAACTGAAAAAACACTCTACCTAATCATGGAATATGCAAGTGGAGGTGAAGTA 

TTTGACTATTTGGTTGCACATGGCAGGATGAAGGAAAAAGAAGCAAGATCTAAATTTAGA 

CAGATTGTGTCTGCAGTTCAATACTGCCATCAGAAACGGATCGTACATCGAGACCTCAAG 

GCTGAAAATCTATTGTTAGATGCCGATATGAACATTAAAATAGCAGATTTCGGTTTTAGC 

AATGAATTTACTGTTGGCGGTAAACTCGACACGTTTTGTGGCAGTCCTCCATACGCAGCA 

CCTGAGCTCTTCCAGGGCAAGAAATATGACGGGCCAGAAGTGGATGTGTGGAGTCTGGGG 

GTCATTTTATACACACTAGTCAGTGGCTCACTTCCCTTTGATGGGCAAAACCTAAAGGAA 

CTGAGAGAGAGAGTATTAAGAGGGAAATACAGAATTCCCTTCTACATGTCTACAGACTGT 

GAAAACCTTCTCAAACGTTTCCTGGTGCTAAATCCAATTAAACGCGGCACTCTAGAGCAA 

ATCATGAAGGACAGGTGGATCAATGCAGGGCATGAAGAAGATGAACTCAAACCATTTGTT 

GAACCAGAGCTAGACATCTCAGACCAAAAAAGAATAGATATTATGGTGGGAATGGGATAT 

TCACAAGAAGAAATTCAAGAATCTCTTAGTAAGATGAAATACGATGAAATCACAGCTACA 

TATTTGTTATTGGGGAGAAAATCTTCAGAGCTGGATGCTAGTGATTCCAGTTCTAGCAGC 

AATCTTTCACTTGCTAAGGTTAGGCCGAGCAGTGATCTCAACAACAGTACTGGCCAGTCT 

CCTCACCACAAAGTGCAGAGAAGTGTTTCTTCAAGCCAAAAGCAAAGACGCTACAGTGAC 

CATGCTGGACCAGCTATTCCTTCTGTTGTGGCGTATCCGAAAAGGAGTCAGACAAGCACT 

GCAGATGGTGACCTCAAAGAAGATGGAATTTCCTCCCGGAAATCAAGTGGCAGTGCTGTT 

GGAGGAAAGGGAATTGCTCCAGCCAGTCCCATGCTTGGGAATGCAAGTAATCCTAATAAG 

GCGGATATTCCTGAACGCAAGAAAAGCTCCACTGTCCCTAGTAGTAACACAGCATCTGGT 

GGAATGACACGACGAAATACTTATGTTTGCAGTGAGAGAACTACAGCTGATAGACACTCA 

GTGATTCAGAATGGCAAAGAAAACAGCACTATTCCTGATCAGAGAACTCCAGTTGCTTCA 

ACACACAGTATCAGTAGTGCAGCCACCCCAGATCGAATCCGCTTCCCAAGAGGCACTGCC 

AGTCGTAGCACTTTCCACGGCCAGCCCCGGGAACGGCGAACCGCAACATATAATGGCCCT 

CCTGCCTCTCCCAGCCTGTCCCATGAAGCCACACCATTGTCCCAGACTCGAAGCCGAGGC 

TCCACTAATCTCTTTAGTAAATTAACTTCAAAACTCACAAGGAGTCGCAATGTATCTGCT 

GAGCAAAAAGATGAAAACAAAGAAGCAAAGCCTCGATCCCTACGCTTCACCTGGAGCATG 

AAAACCACTAGTTCAATGGATCCCGGGGACATGATGCGGGAAATCCGCAAAGTGTTGGAC 

GCCAATAACTGCGACTATGAGCAGAGGGAGCGCTTCTTGCTCTTCTGCGTCCACGGAGAT 

GGGCACGCGGAGAACCTCGTGCAGTGGGAAATGGAAGTGTGCAAGCTGCCAAGACTGTCT 

CTGAACGGGGTCCGGTTTAAGCGGATATCGGGGACATCCATAGCCTTCAAAAATATTGCT 

TCCAAAATTGCCAATGAGCTAAAGCTGTAA 
SEQ ID NO: 35 W90839_M 

AAAGGGCCGTCCTGGTCCAGCCGTTCCCTGGGTGCCCGTTGCCGGAACTCTATCGCTTCC 
TGCCCTGAGGAACAACCCCATGTGGGCAACTATAGGCTGCTAAGGACCATCGGGAAGGGC 
AACTTCGCCAAAGTCAAGCTGGCTCGGCATATCCTCACGGGCCGGGAGGTCGCTATTAAG 
ATCATTGATAAGACCCAGCTGAACCCCAGTAGCTTGCAGAAGCTGTTCAGAGAAGTCCGA 
ATTATGAAGGGACTCAACCACCCCAACATCGTGAAGCTTTTTGAGGTGATAGAGACGGAG 
AAGACGCTATACCTGGTGATGGAATACGCTAGCGCAGGAGAAGTGTTTGACTACCTCGTG 
TCGCACGGCCGCATGAAGGAGAAGGAGGCTCGAGCCAAGTTCCGGCAGATCGTGTCAGCC 
GTGCACTACTGTCATCAGAAGAACATTGTACACAGGGATCTAAAGGCTGAAAACCTGTTG 
CTGGATGCCGAGGCCAACATCAAAATCGCCGACTTCGGCTTCAGCAATGAGTTCACGCTG 
GGCTCCAAGCTGGACACCTTCTGTGGGAGCCCCCCATACGCCGCCCCAGAGCTGTTCCAG 
GGCAAGAAGTATGATGGGCCAGAGGTGGACATCTGGAGCCTGGGTGTCATCCTGTACACG 
CTGGTCAGCGGCTCCCTGCCCTTCGATGGGCACAACCTCAAGGAGCTGCGGGAGCGAGTC 
CTCAGAGGAAAGTACCGGGTCCCCTTCTACATGTCTACAGACTGCGAGAGCATTCTGCGG 



Silo 
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FIGURE 2Z 

AGATTTCTGGTGCTGAACCCCGCAAAACGCTGTACTCTGGAGCAAATCATGAAAGACAAA 

TGGATCAACATCGGCTATGAGGGTGAGGAGCTGAAGCCAGACACGGAGCTCAAAGAAGAG 

CGGATGCCGGGTCGGAAAGCGAGCTGCAGTGCAGTGGGCAGTGGAAGTCGAGGCTTGCCC 

CCCTCCAGCCCCATGGTCAGCAGTGCCCACAACCCCAATAAGGCAGAGATCCCTGAGCGG 

CGGAAGGACAGCACTAGCACCCCTAACAACCTCCCCCCCAGCATGATGACCCGAAGAAAC 

ACCTATGTGTGCACAGAGCGACCAGGATCTGAACGCCCGTCCTTGTTGCCAAATGGCAAA 

GAAAATAGCTCCGGTACCTCGCGGGTGCCCCCTGCCTCGCCTTCCAGTCATAGCCTGGCT 

CCCCCGTCAGGCGAGCGGAGCCGCCTGGCTCGGGGCTCCACCATCCGCAGCACCTTCCAT 

GGGGGCCAGGTCCGAGACCGGCGGGCAGGGAGCGGGAGTGGCGGGGGTGTGCAGAATGGA 

CCCCCAGCCTCACCCACGCTTGCCCACGAGGCCGCACCCCTGCCCTCCGGGCGGCCTCGC 

CCCACCACCAACCTCTTCACCAAGCTGACCTCCAAACTGACCCGAAGGGTCACAGACGAA 

CCTGAGAGAATCGGGGGACCTGAGGTCACAAGTTGCCATCTACCTTGGGATAAAACGGAA 

ACCGCCCCCAGGCTGCTCCGATTCCCCTGGAGTGTGAAGCTGACCAGCTCGCGACCTTCC 

TGAGGCCCTGATGGCTGCCCTGCGACAGGCCACA 
qpn ID NO- 36 406786.5 H 

GTAGCCGGCTTGGCGTGACCGTCGCCTGATCCAGTTGTTAGAGGTGGAAGCTTGGCAGTT 
GGCCTCCCTTCTTCCCATGGAGGTCGGGGGCTTAACAGTCTTTGAAGAGGACCAGAGATG 
CCTTTCCCAGAGCCTCCCCTTGCCAGTGTCAGCAGAGGGCCCAGCTGCACAGACCACTGC 
TGAGCCCAGCAGGTCGTTTTCCTCAGCCCACAGACACCTGAGCAGAAGGAATGGGCTTTC 

Sgactctgccagagcaggacggcgctctctgaagacagatggagctcctattgtctatc 

ATCACTGGCTGCCCAGAATATTTGTACAAGTAAACTGCACTGCCCTGCTGCCCCTGAGCA 

cacggacccgtccgaaccgcggggcagtgtgtcctgctgctccctgctgcggggactgtc 

CTCAGGGTGGTCCTCACCTCTGCTTCCGGCCCCTGTGTGCAACCCTAACAAGGCCATCTT 

cacggtggatgccaagaccacagagatcctcgttgctaacgacaaagcttgcgggctcct 

GGGGTACAGCAGCCAGGACCTGATTGGCCAGAAGCTCACGCAGTTCTTTCTGAGGTCAGA 
TTCTGATGTGGTGGAGGCCCTCAGCGAGGAGCACATGGAGGCCGACGGCCACGCTGCGGT 
GGTGTTTGGCACGGTGGTGGACATCATCACCCGTAGTGGGGAGAAGATTCCAGTGTCTGT 

SggSgSgLgatgcggcaggagcgccgcctatgctgcgtggtggtcctggag 

GGAGAGGGTCTCGACCTGGGTCGCTTTCCAGAGCGATGGCACCATCACGTCATGTGACAG 
TrTCTTTGCTCATCTTCACGGGTACGTGTCTGGGGAGGACGTGGCTGGGCAGCATATCAC 
AGACCTGATCCCTTCTGTGCAGCTCCCTCCTTCTGGCCAGCACATCCCAAAGAATCTCAA 
GATTCAGAGGTCTGTTGGAAGAGCCAGGGACGGTACCACCTTCCCTCTGAGCTTAAAGCT 
GAAATCCC^CCCAGCAGCGAGGAGGCGACCACCGGTGAGGCGGCCCCTGTGAGCGGCTA 
CC^GCATCTGTCTGGGTGTTCTGCACCATCAGTGGCCTCATCACCCTCCTGCCGGATGG 
GACCATCCACGGCATCAACCACAGCTTCGCGCTGACACTGTTTGGTTACGGAAAGACGGA 
GCTCCTGGGCAAGAATATCACTTTCCTGATTCCTGGTTTCTACAGCTACATGGACCTTGC 
GTACAACAGCTCATTACAGCTCCCAGACCTGGCCAGCTGCCTGGACGTCGGCAATGAGAG 
TGGGTGTGGGGAGAGAACCTTGGACCCGTGGCAGGGCCAGGACCCAGCTGAGGGGGGCCA 
GGATCCAAGGATTAATGTCGTGCTTGCTGGTGGCCACGTTGTGCCCCGAGATGAGATCCG 
GAAGCTGATGGAAAGCCAAGACATCTTCACCGGGACTCAGACTGAGCTGATTGCTGGAGG 
rrAGCTCCTTTCCTGCCTCTCACCTCAGCCTGCTCCAGGGGTGGACAATGTCCCAGAAGG 

Sg?ctgccagtgcacggtgaacaggcgctgcccaaggaccagcaaatcactgccttggg 

GAGAGAGGAACCTGTGGCAATAGAGAGCCCCGGACAGGATCTTCTGGGAGAAAGCAGGTC 

^gSccagtcgatgtgaagccatttgcttcctgcgaagattctgaagctccagtccca^ 

TGAGGATGGGGGCAGTGATGCTGGCATGTGTGGCCTGTGTCAGAAGGCCCAGCTAGAGCG 

gSgggagtcagtggtcccagcggttcagacctttgggctggggctgccgtggccaagcc 
SaSccaaggg 

tcaatggggcttgtggtggcgaagccaggacttggcccccagcccctctgggatggcagg 
cctctcgtttgggacacctactctagatgagccgtggctgggagtggaaaacgaccgaga 
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FIGURE 2AA 

AGAGCTGCAGACCTGCTTGATTAAGGAGCAGCTC 

GGATGTCCCCCACGCCGAAC^^ 
GTCCTGCGATCTGGGAGGCAGAGACCTGTGCGGTG^ 

CTATGCCTTGGCCACGGACCTCCCTGGGGGCCTGG^ 

TGTGAATTCGTTTTCCTGGAACCTC^GGAACT^ 

GTCATCAAATTGTTCCTGTGCTACGTCTGA 

GGGCTCCGATCCAGATGTAGGCAGTCTCC^ 

GGAGCTGTTACTACTGACCGGCACCTGTG^GACCll GGTGT CCTCTGAGCA 
GAGCTGTGTGGGACATGATCCAACAGA^CCGCT^ 

TTATGCAGCAAGCGACAGAGAAAGCCCAGGACACGllu ACC TCCACGCC 

tSggacacgtgccc*tcagcaga^ 

CGTGATCGTGATGCGCGGGGCTGCTGGCCTG^ 
CGGGAGCTGCTACCATCGAGATGGCTTACGGCTGAGTA^ 

GTTCTTCCAGCTTGTGATGGAGA^GCACGGC^ 
CCGCCACCCCAGGCTGGATGAGCCC^ 

CCAGAGCCGTCTAGTGTCAGCAGTGGGATAUCl CAAGCTGATAGAC TTTGG 

CATCAAGGATGAGAACATCGTGATCGCTGAGGA^ 

CTCGGCCGCCTACTTGGAAAGGGGAAAATTATTTTATAC^ 

CTGTGCACCGGAAGTTCTCATGGGGAAT^ 
TCTGGGAGTCACTCTGTACACGCTGGT^^ 

GACCGTGGAGGCTGCCATACACCCGCCATA^l TTGGAGAAG CTGGTGACAGA 
GTCTGGGCTGCTGCAGCCAGTCCCTGAGAGACGC^ 
CCCGTGGGTAACACAGCCTGTGA^TCTTGCTGA^ 
AAACAAGCCAGAAAGTGGAGTTCTGTCCGCTGCG^AGCC^ 

GAGTGATGTGGCCCAGGCTCAGGAGCTTTGTG^ 

SEQ ID NO: 37_AA544838 M ^06786 TTCATCTTTCG ACAACTAG 
CCACGCGTCCGCATCCCTGCTTGGATGAGCCCCTGGCGA 

TGTCTGCTGTAGGATACCTGCACTCCCAAGGCATCATCC^ 

SLtgtgattgctgaggact^ 

TAGAGAGGGGCAAACTATTTTATACCTTTTGTGGAA TCTCTGGGGGT CACCC 
TTCTCATTGGAAATCCCTACAGAGGGCCAGAGCTGG^ 

TGTACACGCTCATCTTCGAGGAGAATCCCTOTGTGA TCTGTC TGGACTGCTGC 
TTATTCATCCCCCATTCCTG^ 

AGCCTTGCCCTGAGCAGCGGACCACTTTGGA^A^ ^ 

AGCCTGTGAACCTTGCTAGCTATACTTG^ 

GTGGCCTGCTGTCAGCTGCA^GTCT^ 

AGAGAGAGGGTCTCTGTGGGCCTCCTGCTCCCA ^ OTCCTC cTCTTTC3t5TTTG 



55/19 
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FIGURE 2BB 

TGACTTCTGCTTCTAGATTCCTATGCAAATG^ 

AGGTTTATGATATAGACTTG^ 
TTCAGTTATACTGTACTATTA^GGGCTTTAAGTTC^ 

TGTTTAAGTTCTCTATTTTTGTTGT^ 
ACCATGTAACTTTGGCTGGCCTGGAACTC^ 

S^CTTGGTTITCTATTTCTTTTTGT 
gMCGOMCQCMTTCOOCa^^ 



CTTGCAGCKGGGCCGGTCCGGGTGGGOTCTACGACA ^ MTGGCAM *AA 
CAACTTCGCTGTGGTGAAGCTGGG^^ 

AAGGATGTTGGTCCTAGACCCATCCAAACGGCTAACCATA^ 
ATGGATGCTCATAGAAGTTCCTGTCCAGAGACCTGTTCTCTA^ 
TGAGCCATCCATCGGGGAGTTTAATGAGCA^GGTTCT 

AGATCAGCAGAAARCCATTGAGTCTTTGCAGAALAA CCCAGTGGAGC AGAG 

TTATTTCTTGTTGGTGGAGCGCCTGAAATCAC^TCGGA 

ACTTGATGGCCGCCAGCGTCGGCCTAGCACCATTG^ 

gaSgtggggctcccagtgaccat^ 

CCTCCCCCAGGCATCCAACGTGGAGGCCTTTTCATTT^ ^ggctGTCTGCTTGA 

AGCTGCATTCATGGAAGAAGAGTGTGTGGACA^ 

CCCTGTGCCTCCTGTCCTGGTGCGGA^ 
GACCTCCATTGACGAAGGGCTGGAGACAGA^GGAG^ 

CTTTGAGGCATTTCAGTCCACACGCAGCGGGCAGAGACO GAC AGCCCCTC 

SScaactggtcgtgatgcctggg^^ 
ccttgacagtgtggactctgagtatgatatc 

GGAAGACAACCCTTCCCTTAAGGACATCATG^ 

atctcccttcataagcctgagacctacc^cccmccat GCAGAGCA tcagatac 

ACGAGAGGTCCACAACAGGTCTCCAGTGAGCTTCAGAGA 

C^CCCTCACCCAGGGAATTGTAGCATTTAG^ 

AGGAATTCTAGAGTTGAACAAAGTGC^ 
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FIGURE 2CC 

GCAGCAGCTGCAGGAACATAGGCTCCAGCAGAAGCGACTCTTTCTTCAGAAGCAGTCTCA 
RPTGCAGGCCTATTTTAATCAGATGCAGATAGCAGAGAGCTCCTACCCACAGCCAAGTCA 
GCAGCTGCCCCTTCCCCGCCAGGAGACTCCACCGCCTTCTCAGCAGGCCCCACCGTTCAG 
CCTGACCCAGCCCCTGAGCCCCGTCCTGGAGCCTTCCTCCGAGCAGATGCAATACAGCCC 
TTTCCTCAGCCAGTACCAAGAGATGCAGCTTCAGCCCCTGCCCTCCACTTCCGGTCCCCG 

TGAGCTGCCAAGCGCTGCTTCCCCTGCGCCAGACTATCCCACTCCCTGTCAGTATCCTGT 
GGATGGAGCCCAGCAGAGCGACCTAACGGGGCCAGACTGTCCCAGAAGCCCAGGACTGCA 

agaggSSScagctacgacccactagccctctc^ 

TGAAATGCTAGACGCTGTGGATCCACAACACAACGGGTATGTCCTGGTGAATTAGTCTCA 
GCACAGGAATTGAGGTGGGTCAGGTGAAGGAAGAGTGTATGTTCCTATTTTTATTCCAGC 
CTTTTAAATTTAAAGCTTATTTTCTTGCCCTCTCCCTAACGGGGAGAAATCGAGCCACCC 
AACTGGAATCAGAGGGTCTGGCTGGGGTGGATGTTGCTTCCTCCTGGTTCTGCCCCACCA 
CAAAGTTTTCTGTGGCAAGTGCTGGAACATAGTTGTAGGCTGAGGCAGGAGAATGGCGTG 
ScCCGGGAGGCGGAGCTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGCG 
ATTCAGCAAGACTCCACCTCAAAAAAAAAAAAAAAGGACAAGAGCAGTATCATCTGCCTC 

tg^Sa^ctggacaaagagattttcttaaagtttctatcatctcccttctgacaggt 

TCTACAGTGTGGTCTGAAGCACCTGTAATGTCAGAGCCCTTGTCTGGCCCTTGGTGGCAG 
GTGAACGAAAGCAGTGGAGCCTCTCACCTTCCAGTAGCCTCTCACATTCTTATTTTACCA 

?™gtcctaattaaggtagcctagctgattctagaagacagccatcctacgtgcaccc 

CCACCTTGTGTCCACATCTTCTCCAGGCAGGTTTCAACCTATCAGCAGACTCAGGCACAC 

actggggcacagatagagaaccaggcggcagcagtgctcgcagacccacccagggagagc 

TGTGATGGGTTCTGCCCAGATACTCTGCTCGCCCACCCACAAGGGAGCAATAGCTTATAT 

^gScattagttttaccaagcactttctcttctaaccctcacaacaattctatg^ 
IgSggggagatactgtccttatttttcacagctgaagaaaccaaagctttgggaagttt 

"tctctgagatcacagctggtgatagaaggagctg^ 
tggcttctggttttggttctctggcttctagtgctggaagaagccctctctttcccttct 

ctttcctcagtagcatctgactcttttcataagcaaacagctgtataaacaaagccccca 

SSggtcaagcacagggtgaatgtgatattgttcccacaaccttattctccactcaa^ 

gSgcctggctttggggaagaggccgccttcaggtgacagtgcagctgtccaggtggccg 

?gcactgaaccaggctgagggagacaaaaaccccgcagacccgcctgcctttcagcgtcc 

IgSaI??gcagaagtttaggctcacctcaaagatgtctagtttttccaagttacaatac 

agcagtttcctacagaacacccccttcctcaattgccaaggggccgcatcgcacggcatc. 

agSSccactgcaggccagcagattcgaccccaggaacggtcatgaactc 

c?cScgaggggcgtaacatttccttacagtcaagccccatc^ 
ctt^aggattaaaaaagtaataacagactttgacttaatactctgtcttttcagaggca 

SSgggtgggtagaggggagctttaaaaatagaagtacaaaacaacatcctgg^ 

a^gI?cccLatggaataatgtcacattccccagtgcagataatgggctgctgct^ 

?gtggtgtctgtctgcagaagatttgctcagtcaaggaaattcaagtggtgagacctttc 

SccLgggtggtaagagaaacctgccttcaccaaaatctctgaaggggaaagaact 

gagaaaggtttgcttcacttcggggactgcagtttgagaaataaaagggatacagagata 

?c?^ctttgtagaaagggcaagattatttgcttatatctgaagggaggtgggtggttt 

toctggatgtttggtctgaaagagttacttttgataaagttaatctaattgtagttatat 

TTTCTGTGTGCTTTTTTTTAATTACTAAGAAAAAAATTGGTGAGTTCAGTAGCTTTGGTA 

SItgagtgcaaatcataatagctccaatgtgaaaaaaaaaatca^ 

ACTTAATGTTAGAAAATTGCCTAAAATGCAGTGTAATAAATAATCTCTGTACCA^ 

a^S^tggggtaattttctgcaaggaaaatgtactgtttttatgtttccaaccctct 

TGA 
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FIGURE 2DD 

SEQ ID NO: 39_AA207220_H 

GCTGTGGCTCCCCGTCCTGGTGCGGGACCTGTGCCCCGCGCTTCAGCCCTCCCCGCAAGC 

CTATTGATTCCCCTGCCGCCCTTGCTCCACCTCCTGCTCGCCATGGAGTCGCTGGTTTTC 

GCGCGGCGCTCCGGCCCCACTCCCTCGGCCGCAGAGCTAGCCCGGCCGCTGGCGGAAGGG 

CTGATCAAGTCGCCCAAGCCCCTAATGAAGAAGCAGGCGGTGAAGCGGCACCACCACAAG 

CACAACCTGCGGCACCGCTACGAGTTCCTGGAGACCCTGGGCAAAGGCACCTACGGGAAG 

GTGAAGAAGGCGCGGGAGAGCTCGGGGCGCCTGGTGGCCATCAAGTCAATCCGGAAGGAC 

AAAATCAAAGATGAGCAAGATCTGATGCACATACGGAGGGAGATTGAGATCATGTCATCA 

CTCAACCACCCTCACATCATTGCCATCCATGAAGTGTTTGAGAACAGCAGCAAGATCGTG 

ATCGTCATGGAGTATGCCAGCCGGGGCGACCTTTATGACTACATCAGCGAGCGGCAGCAG 

CTCAGTGAGCGCGAAGCTAGGCATTTCTTCCGGCAGATCGTCTCTGCCGTGCACTATTGC 

CATCAGAACAGAGTTGTCCACCGAGATCTCAAGCTGGAGAACATCCTCTTGGATGCCAAT 

GGGAATATCAAGATTGCTGACTTCGGCCTCTCCAACCTCTACCATCAAGGCAAGTTCCTG 

CAGACATTCTGTGGGAGCCCCCTCTATGCCTCGCCAGAGATTGTCAATGGGAAGCCCTAC 

ACAGGCCCAGAGGTGGACAGCTGGTCCCTGGGTGTTCTCCTCTACATCCTGGTGCATGGC 

ACCATGCCCTTTGATGGGCATGACCATAAGATCCTAGTGAAACAGATCAGCAACGGGGCC 

TACCGGGAGCCACCTAAACCCTCTGATTGCCTGNNTGGCCTGATCCGGTGGCTGTTGATG 

GTGAACCCCACCCGCCGGGCCACCCTGGAGGATGTGGCCAGTCACTGGTGGGTCAACTGG 

GGCTACGCCACCCGAGTGGGAGAGCAGGAGGCTCCGCATGAGGGTGGGCACCCTGGCAGT 

GACTCTGCCCGCGCCTCCATGGCTGACTGGCTCCGGCGTTCCTCCCGCCCCCTCCTGGAG 

AATGGGGCCAAGGTGTGCAGCTTCTTCAAGCAGCATGCACCTGGTGGGGGAAGCACCACC 

CCTGGCCTGGAGCGCCAGCATTCGCTCAAGAAGTCCCGCAAGGAGAATGACATGGCCCAG 

TCTCTCCACAGTGACACGGCTGATGACACTGCCCATCGCCCTGGCAAGAGCAACCTCAAG 

CTGCCAAAGGGCATTCTCAAGAAGAAGGTGTCAGCCTCTGCAGAAGGGGTACAGGAGGAC 

CCTCCGGAGCTCAGCCCAATCCCTGCGAGCCCAGGGCAGGCTGCCCCCCTGCTCCCCAAG 

AAGGGCATTCTCAAGAAGCCCCGACAGCGCGAGTCTGGCTACTACTCCTCTCCCGAGCCC 

AGTGAATCTGGGGAGCTCTTGGACGCAGGCGACGTGTTTGTGAGTGGGGATCCCAAGGAG 

CAGAAGCCTCCGCAAGCTTCAGGGCTGCTCCTCCATCGCAAAGGCATCCTCAAACTCAAT 

GGCAAGTTCTCCCAGACAGCCTTGGAGCTCGCGGCCCCCACCACCTTCGGCTCCCTGGAT 

GAACTCGCCCCACCTCGCCCCCTGGCCCGGGCCAGCCGACCCTCAGGGGCTGTGAGCGAG 

GACAGCATCCTGTCCTCTGAGTCCTTTGACCAGCTGGACTTGCCTGAACGGCTCCCAGAG 

CCCCCACTGCGGGGCTGTGTGTCTGTGGACAACCTCACGGGGCTTGAGGAGCCCCCCTCA 

GAGGGCCCTGGAAGCTGCCTGAGGCGCTGGCGGCAGGATCCTTTGGGGGACAGCTGCTTT 

TCCCTGACAGACTGCCAGGAGGTGACAGCGACCTACCGACAGGCACTGAGGGTCTGCTCA 

AAGCTCACCTGAGTGGAGTAGGCATTGCCCCAGCCCGGTCAGGCTCTCAGATGCAGCTGG 

TTGCACCCCGAGGGGAGATGCCTTCTCCCCCACCTCCCAGGACCTGCATCCCAGCTCAGA 

AGGCTGAGAGGGTTTGCAGTGGAGCCCTGAGCAGGGCTGGATATGGGAAGTAGGCAAATG 

AAATGCGCCAAGGGTTCAGTGTCTGTCTTCAGCCCTGCTGAACGAAGAGGATACTAAAGA 

GAGGGGAACGGGAATGCCCGCGACAGAGTCCACATTGCCTGTTTCTTGTGTACATGGAGG 

GGCCACAGAGA 

SEO ID NO: 40 AA426580_H, MAK_V_H 

ATGCCGGCGGCGGCGGGGGACGGGCTCCTGGGGGAGCCGGCGGCGCCTGGGGGCGGCGGC 
GGCGCGGAGGACGCGGCCAGGCCCGCGGCGGCCTGCGAGGGAAGTTTCCTGCCTGCCTGG 
GTGAGCGGCGTGCCCCGCGAGCGGCTCCGCGACTTCCAGCACCACAAGCGCGTGGGCAAC 
TACCTCATCGGCAGCAGGAAGCTGGGCGAGGGCTCCTTTGCCAAGGTGCGCGAGGGGCTG 
CACGTGCTGACCGGGGAGAAGGTGGCCATAAAAGTCATTGATAAGAAGAGAGCCAAAAAG 
GACACCTATGTCACCAAAAACCTGCGGCGAGAGGGTCAGATCCAGCAGATGATCCGCCAC 
CCCAATATCACTCAGCTCCTTGATATTTTAGAAACGGAAAACAGCTACTACCTGGTCATG 
GAGCTGTGCCCTGGGGGCAACCTGATGCACAAGATCTATGAGAAGAAGCGGCTGGAGGAG 
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FIGURE 2EE 

TCCGAAGCCCGCAGATACATCCGACAGCTCATCTCTGCCGTAGAGCACCTGCACCGGGCC 

GGGGTGGTCCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 

AAGCTGATTGACTTTGGTTTGAGCAACTGCGCAGGGATCCTGGGTTACTCGGATCCGTTC 

AGCACACAGTGTGGCAGCCCTGCCTACGCTGCACCTGAACTGCTCGCCAGGAAGAAATAC 

GGCCCCAAAATCGATGTCTGGTCCATAGGTGTGAACATGTATGCCATGTTGACCGGGACG 

CTGCCTTTCACGGTGGAGCCTTTCAGCCTGAGGGCTTTGTACCAGAAGATGGTAGACAAA 

GAAATGAACCCCCTCCCCACTCAGCTCTCCACAGGTGCCATCAGTTTCCTGCGCTCTCTC 

CTGGAACCGGATCCTGTGAAGAGGCCAAATATTCAGCAGGCACTGGCGAATCGCTGGCTT 

AATGAGAATTACACGGGCAAAGTGCCCTGTAATGTCACCTATCCCAACAGGATTTCTCTG 

GAAGATCTGAGCCCGAGCGTCGTGCTGCACATGACCGAGAAGCTGGGTTACAAGAACAGC 

GACGTGATCAACACTGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTC 

TTAAACAAGAAACTGGAGCGCTATTTGTCAGGGAAATCTGACATCCAGGACAGCCTCTGC 

TACAAGACCCGGCTCTACCAGATAGAAAAGTACAGGGCCCCCAAGGAGTCCTATGAGGCC 

TCTCTGGACACCTGGACACGAGATCTTGAATTCCATGCCGTGCAGGATAAAAAGCCCAAA 

GAACAAGAAAAAAGAGGGGATTTTCTTCATCGACCATTCTCCAAGAAGTTGGACAAGAAC 

CTGCCCTCGCACAAACAGCCCTCAGGCTCGCTTATGACACAGATTCAGAACACCAAAGCC 

CTCCTGAAGGACCGGAAGGCCTCCAAGTCCAGCTTCCCCGACAAAGATTCCTTTGGCTGC 

CGCAATATTTTCCGCAAAACCTCAGATTCCAATTGTGTGGCTTCTTCTTCCATGGAGTTC 

ATCCCCGTGCCACCGCCCAGGACCCCGAGGATTGTGAAGAAACCGGAGCCCCATCAGCCA 

GGGCCCGGAAGCACTGGCATCCCCCACAAGGAAGACCCCCTGATGCTGGACATGGTGCGC 

TCCTTCGAGTCTGTGGATCGCGACGACCACGTAGAAGTGCTGTCTCCCTCTCATCACTAC 

AGGATTCTGAACTCCCCGGTCAGCTTGGCTCGCAGAAATTCCAGCGAGAGGACGCTGTCC 

CCGGGTCTGCCATCCGGAAGCATGTCGCCTCTCCATACTCCTTTGCATCCAACTCTGGTC 

TCTTTTGCTCACGAAGATAAGAACAGCCCCCCAAAAGAGGAGGGCCTGTGTTGCCCACCT 

CCGGTTCCCAGCAATGGCCCCATGCAGCCTCTGGGGAGCCCCAATTGTGTGAAAAGCCGA 

GGCCGGTTCCCTATGATGGGCATCGGACAGATGTTAAGGAAGCGCCATCAGAGTCTGCAG 

CCATCTGCAGATAGGCCCCTGGAGGCCAGCCTGCCCCCACTGCAGCCCCTAGCCCCTGTG 

AACCTTGCCTTTGACATGGCCGATGGGGTCAAGACCCAGTGCTAA 

SEQ ID NO: 41_Z36720_H 

ATGGACACAAAGCTGAACATGCTGAACGAGAAGGTGGACCAGCTCCTGCACTTCCAAGAA 
GATGTCACAGAGAAGTTGCAGAGCATGTGCCGAGACATGGGCCACCTGGAGCGGGGCCTG 
CACAGGCTGGAGGCCTCCCGGGCACCGGGCCCGGGCGGGGCTGATGGGGTTCCCCACATT 
GACACCCAGGCTGGGTGGCCCGAGGTCCTGGAGCTGGTGAGGGCCATGCAGCAGGATGCG 
GCCCAGCACGGTGCCAGGCTGGAGGCCCTCTTCAGGATGGTGGCTGCGGTGGACAGGGCC 
ATCGCTTTGGTGGGGGCCACGTTCCAGAAATCAAAGGTGGCGGATTTCCTCATGCAGGGG 
CGTGTGCCCTGGAGGAGAGGCAGCCCAGGTGACAGCCCTGAGGAGTGGGTAAAAGAGGAG 
GAGGTCTGTTTCATGCCTCCAGTTCCCCCAGCTCCGGGGGCAGCAGGACAGAGCCTGCAG 
AAGGATAAGGGGGAGCTGTCTGCCGAGCAGGGGATCTGGGCCACATTGATGACGCTGGTG 
ATCATGGTGACAGCGGCAAATAAAGAGCGAGTGGAAGAAGAGGGAGGAAAACCAAAGCAT 
GTGCTGAGCACCAGTGGGGTGCAGTCTGATGCCAGGGAGCCTGGGGAAGAGAGCCAGAAG 
GCGGACGTGCTGGAGGGGACAGCGGAGAGGCTGCCCCCCATCAGAGCGTCAGGGCTGGGA 
GCTGACCCCGCCCAGGCAGTGGTCTCACCGGGCCAGGGAGATGGTGTTCCTGGCCCAGCC 
CAGGCATTCCCTGGCCACCTGCCCCTGCCCACAAAGGTGGAAGCCAAGGCTCCTGAGACA 
CCCAGCGAGAACCTCAGGACTGGCCTGGAATTGGCTCCAGCACCCGGCAGGGTCAATGTG 
GTCTCCCCGAGCCTGGAGGTTGCACCAGGTGCAGGACAAGGAGCATCGTCCAGCAGGCCT 
GACCCTGAGCCCTTAGAGGAAGGCACGAGGCTGACTCCAGGGCCTGGCCCTCAGTGCCCA 
GGGCCTCCAGGGCTGCCAGCCCAGGCCAGGGCAACCCACAGTGGTGGAGAAACACCTCCA 
AGGGCAGCCCTGCTGAAGGGCGCTGTGGCCCCGGGCTTCTCTCGGAGGGACCTGGTGTTT 
CCTAGCATCTTCTGCGCCTGCCTAGGGATCTCCATCCACATACAAGAGATGGATACTCCT 
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FIGURE 2FF 

~==== 

acgaSSgcttggccaggagtgacgacaatgaccacgaggttggggccctgggcctg 
?SSggcaaaagcccaggggcgggaaaccctgagcctgagcaggactgtgcagccagg 

g??ctggatgacagtcg(3Gccccaccagctccttttgaacaccgggtagtgagcgtcaag 
gIgaotccatctctgcgggttacgaggtgtgccagcacgaagtcttgggagggggtcgg 
???ggccaggtccacaggtgcacagagaagtccac^ggcctcccactggctgccaagatc 
Scaaagtgaagagcgccaaggaccgggaggacgtgaagaacgagatcaacatcatgaac 
cIgc^agccacgtgaacctgatccagctctatgacgccttcgagagcaagcacagctgc 
acccttgtcatggagtacgtggacgggggtgagctcttcgaccggatcacagatgaga^g 
?accacctgactgagctggatgtggtcctgttc^ccaggcagatctgtgagggtgtgca^ 
Scctgcaccagcactacatcctgcagctggacctcaagccggagaacatattgtgcgtc 

c^IgagaIgctgaaggtgaacttcggcactcgtgagttcctggccccagaagtcgtcaat 
tatgagtttgtctcattccccacagacatgtggagtgtgggagtcatcacctacatgcta 

ScAGTGGCTTGTCCCCATTTCTAGGGGAAACAGATGGAGAGACCATGAATTTCATTGTA 

SSSagctgggattttgatgctgacacctttgaagggctctcggaggagggcaagga^ 
™tttcccggttgctggtcaaagagaagagctgcagaatgagtgccacacagtgcctg 
aaacacgagtggctgaataatttgcctgccaaagcttcaagatccaaaactcgtctcaaa 

^ca!c?ac?gctgcagaaatacatagctcaaagaaaatgg^ 
gtgactgctgccaacaggttaaggaaatttccaacttctccctaa 

qph TD NO- 4 2 SGK088 H 

ggggagatggcgc^gtttgagtgcctggtggcggggcccactgacgtggaggtggattgg 
ctc?g?cg?ggccgcctgctgcagc 

cgcaaatgcaagctgctacttacatctgtacatgaggacgacagtggcgtctacacctgc 

Sgc^Sca^gSccaaagatgaggtgacgtgcagtgcccgggtgaccgtggggcc 

^ggcacccctgttcacacggctgctggaagatgtggaggtgttggagggccgagctgcc 

ScSStggaggagagtgagaacttgcggctgcggcaggacgggggtctgcactcactg 
cacatocccatgtgggcagcgaggacgaggggctctatgcggtcagtgctgttaacac^ 

Stggccaggcc?actgctcagcgcag 
Scc2cagScgaagctggagaagatgccatccattgccgaggagcca 

SggSggctg^cattcccgacttcctgcggccactg^ 

gSSggagg?S?gctagagtgccaggtgacgggcctgcgctacc 

ttccacaatggccaccgcatccagagcagcgacgaccggcgcatgacacagtacagggat 

ctcca^St^tgttcggtgccgtggggcctcagcacgccggtgtctacaagagg 

aSgccaacLgctgggcaaagctgcctgctatgcccacctgtatgtcacagatgtg^ 

cSgg?otc^agatggcgccccgcaggtggtggctgtgacggggaggatggtcacactc 

aca?ggaacccccccaggagtctggacatggccatcgacccggactccctgacgtacaca 

gtccagcaccaggtgctgggctcggaccagtggacggcactggtcacaggcctgcgggag 

ccagggtgggcagccacagggctgcgtaagggggtccagcacatcttccgggtcctcagc 

ACcSScA^AGCAGCAGCAAGCCCTCACCCCC^ 

C acggc?caaccctggaggaggcccctgccatgctggacaaaccagacatcgtgtatgtg 

GTGGAGGGA^AGCCTGCCAGCGTCACCGTCACATTCAACCATGTGGAGGCCCAGGTCGTC 

?ggaggag^ 



t>°l IP) 
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FIGURE 2GG 

-^GGTGCTGCTGACCGAGAGCAGCCATGT^ 



CTGGCGGGTGAGGTCTCCTGCAAAGCAGAGT^ 

CCTOTCGGTGTTGTTGCGTTGCTCTGTGT^ 

GAGTTCTCTGGCTCCCGGGTGTCCCTCACAGAC^ 
A^CCCAGAGACTGGGGCTGCCACCCCCATGGACTG^CAG 

CAGGACCAGGAGGCTCCCAGCCCAGAGGCCCTCCCC1L GCTGAGAGC GCCCTGCCC 

GGGGCTAGCCCCAGGCGGGGAGAGCTCCGCAGGGG^ 

CGGGCCGGGCCGCGGGAGCTGGGCCGGGGCCTGCACA^ 

CCCrcGATGGCACGAGCTGCCTCGAGCGAGGCAGCGCCG^^ 



PCTAJS00/14842 

WO 00/73469 




Ia^ggaatcgagggacaggaagggggagg^ 

G^CTCAGGGAGAACCAAGGAAGGTGGGCATGGCTC 

tSStgtcagggcagtaggctgggagtc^ 
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FIGURE 211 

GAGAGCTGGAGGCCAGCAGTCACTCACACTCGCTCTGTCCTCCTGTC^GTGGATACAGC 
CCTGGGCGCTCTGCTGGCCCAAGGATGTCCCCACTGCCCCTCCATGGCCTTTGGCCTTCT 
TCCCATTCATATTTATTTATTTATTGACTTTTATGAAGTTTCCCCTTCCATCCGATCCCT 

Ictgcccatgttgtcctgaccatccctcccagccatccagct^^ 
aggaaataaaaatggcaagcagcaaaaaaa 

cpn TD NO- 43 AA542015 M SGK088_M 

GCCACGGACATCTGGGGAGCGGGTGTGCTCACCTACATCATGCTTAGTGGGTACTCCCCA 
TTCTATGAGCCAGACCCCCAGGAAACAGAGGCTCGGATTGTTGGGGGTCGCTTTGATGCC 
TTCCAGTTGTATCCTAACACATCCCAGAGTGCCACCCTCTTCTTGAGAAAGGTCCTCTCA 
GTACATCCCTGGAGCCGGCCCTCTCTGCAGGACTGCTTGGCCCACCCATGGCTGCAAGAT 
GCCTACCTGATGAAGCTGCGCCGCCAGACACTCACCTTCACCACCAACCGGCTCAAGGAA 

?tcSgggcgagcagcggcgacgtcgggctgaggctgctacccgtcacaaggtgctgctc 

CGCTCCTACCCTGGCAGCCCCTAGGTGGCACAGACCGCAGCCCGGCCACGGGCTTCAACT 
TGGGTTCTCACTCGCGCTGCCAAGGGACATTCCAGAGCCCATGCTGAGCTGGACAGGCAG 
GGGCTTCAGATACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACATCTGGCTGGGCTATT 

aStStggacctaagaggacaaggccctggggcttcagccgaatgtcacccc 

ACCAGAGCAGGAGACCCACTGGCCAGGCTGGGCAAGGGTGAGAGCAGAAAGAGGCAAAGA 

Sggagttgggaagtgaagaatgagacggaggatagagagggaggagtttgaggaaggttc 

?agSctggagtggaatgctatatctcagggagaagccagaaggggacatggctg^^ 

aagaaggaccctgtgatgtgggaatgtggtggagaggaggactggacatagagagtgtgc 

cagg^gccagagcagagacataagggagggcagaagggtagaaggcaacaggagtggg^ 

caggggtggcagggcaggccagcagctgcatcttcagaaagagagaggagaaaggcaaag 

aga?gaSgSccgctccagctggtctcctgtcccagccgatgcagttctgggcg 

aSggc^agggatgtcctcactgctcctccatggcctttgccctccttcccatttgtat 

™ttatttattgccttttgtggagtttcctttctatccagtccctagtgcctatgttg 

tcccgaccatcccccttcagtcacccagctgtctgtgcagctgtctgtctgtctgtcaca 

aggaaaataaaacaaaacaaaacaaaacaaaacaaacaaaaaaaaaacagc 

QFD TT5 NO- 44 R19772 H 

atgaagggcggcgacagggcttacacccgaggtccctctttggggtggctctttgctaag 
tcc^ctgttgcttcccgtgtagagatgcatactctcattcctcaagcgagaatggaggc 
aIg?ccgactccgtagccaacctgcaggcccagccctccctgaacttcatc 

c^ggScc?Sgcgctccaccaacact^ 

S?Scagcgggaaggcagatggaaacatcaaaaagcag^ 

Sga^c??tgLctgggatctcccaagcctggggatgaaacaacccctcagggagacagc 

Stgatgagagcaagaaaggttggggtgaagatgagccggatgaagagtcacacacaccc 
cSccISacctatgaagatttttgac^ 

Sgctagcagcccggcaggcttccactgaagtacctactgctgcagaccttgtcaatgca 
Sa^aaSggtca^ 

aSga^tgcaggctgcctgaatgaggggatggccccacccacacctcctaaaaatcca 
g^gaagaacagaaagccaaggccctgagaggcaggatgtttgtcctgaatgagctggta 
cagac^gagaaagactatgtcaaggatctgggcattgtggtggagggcttcatgaagaga 
SaSSga^gggtgtccctgaggatatgcgaggaaaggacaaaatcgtgtttgga^t 
aScatca^ttatgactggcataaggattttttcctggcggaactggaaaagtgtatc 
cIggagcaagacagattggcacagctctttattaagcacgagcggaagctgcacatc 

?SgaggaLtaaaacaggagataaatcagaggctgacactgagtgacttcctcatcaa 
Scat?cagLaataacaaaataccagt TC 

Sggctggtttggagtgttcagatatcgagaaagcagtggagttaatgtgccttgttccc 
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FIGURE 2JJ 

AAACGCTGCAATGACATGATGAATCTAGGACGTCTGCAGGGCTTTGAGGGCACTCTGACT 

GCTCAGGGGAAGCTACTGCAGCAGGACACATTCTATGTGATCGAGCTGGATGCAGGCATG 

CAGTCCCGGACCAAAGAGAGGCGCGTGTTCCTCTTCGAGCAGATTGTCATCTTCAGTGAA 

CTGCTCAGGAAGGGATCCCTCACCCCTGGCTACATGTTCAAAAGGAGCATCAAGATGAAT 

TACTTGGTCCTGGAGGAGAATGTGGACAATGATCCCTGCAAGTTTGCACTCATGAACAGA 

GAGACTTCTGAGAGGGTTGTTCTGCAAGCCGCCAACGCTGACATCCAGCAGGCCTGGGTG 

CAGGACATCAATCAAGTCTTAGAAACACAGCGAGACTTTTTGAATGCACTGCAATCGCCC 

ATTGAGTATCAACGGAAAGAAAGGAGCACAGCTGTGATGAGGTCTCAACCTGCCAGGCTT 

CCCCAAGCCAGCCCCAGGCCCTACTCCTCTGTTCCTGCGGGCTCAGAGAAGCCCCCAAAG 

GGCTCCAGCTATAACCCACCTCTGCCTCCCCTGAAGATATCTACCTCCAATGGCAGTCCA 

GGGTTTGAATACCACCAGCCTGGGGACAAGTTCGAAGCCAGCAAGAACGACCTGGGAGGC 

TGCAATGGGACCTCGTCCATGGCCGTGATCAAAGATTACTATGCACTGAAGGAGAATGAA 

ATCTGTGTGAGCCAAGGTGAGGTGGTCCAGGTCCTCGCCGTCAACCAGCAGAACATGTGT 

CTGGTGTACCAGCCTGCCAGCGACCATTCCCCCGCCGCCGAGGGCTGGGTCCCAGGCAGC 

ATCCTGGCGCCCCTCACCAAAGCCACAGCAGCAGAAAGTAGTGACGGGAGCATCAAGAAG 

TCATGTTCATGGCATACTCTACGCATGAGAAAGCGGGCGGAAGTGGAGAACACGGGTAAA 

AATGAAGCCACAGGGCCTCGTAAACCCAAGGATATTCTGGGCAACAAAGTCTCTGTTAAA 

GAGACGAACAGTTCCGAGGAATCAGAGTGTGATGATCTTGACCCTAATACTAGCATGGAG 

ATCTTAAATCCAAATTTCATCCAAGAAGTGGCCCCAGAATTCCTTGTGCCCTTGGTGGAT 

GTGACCTGCTTGCTTGGGGACACAGTGATACTGCAGTGCAAAGTCTGTGGGCGGCCAAAG 

CCCACCATCACTTGGAAGGGTCCAGACCAGAACATCCTTGACACTGATAACAGCTCAGCC 

ACATACACGGTCTCCTCTTGTGATTCTGGAGAAATCACCCTGAAGATCTGTAATCTGATG 

CCCCAAGACAGTGGGATTTATACCTGCATAGCAACAAATGACCACGGGACCACATCAACG 

TCTGCAACAGTCAAAGTGCAAGGTGTTCCAGCAGCCCCTAACCGCCCCATTGCCCAGGAG 

AGAAGCTGCACCTCCGTGATTCTCCGCTGGCTGCCCCCCTCCAGCACAGGAAACTGCACT 

ATTTCTGGTTACACTGTGGAGTACAGAGAGGAAGGTTCTCAGATCTGGCAGCAGTCAGTG 

GCTTCGACCTTGGACACTTACCTCGTCATCGAAGACCTTAGTCCCGGGTGTCCTTATCAG 

TTCAGAGTCAGTGCCAGTAACCCCTGGGGAATCAGCCTTCCCAGCGAGCCCTCGGAGTTT 

GTGCGACTTCCAGAATACGATGCTGCTGCTGATGGTGCCACCATTTCTTGGAAGGAAAAT 

TTTGACTCAGCTTACACTGAGCTGAATGAAATTGGAAGAGGCCGTTTCTCTATAGTAAAG 

AAATGCATTCACAAAGCTACCCGCAAAGATGTGGCTGTGAAATTTGTTAACAAAAAAATG 

AAGAAGAAAGAACAGGCTGCCCACGAGGCTGCCCTGCTTCAGCACCTACAGCACCCCCAG 

TACATCACTCTCCATGACACCTATGAGTCCCCCACATCCTACATCCTGATCTTGGAACTG 

ATGGATGATGGCCGGCTCTTAGACTACCTTATGAATCATGATGAACTGATGGAGGAAAAA 

GTAGCTTTCTATATCCGAGACATCATGGAGGCTCTGCAGTACCTTCACAACTGCAGGGTT 

GCACATTTGGACATAAAGCCTGAAAACCTGCTCATTGACCTACGGATTCCAGTGCCTCGA 

GTGAAGCTCATTGACTTGGAGGATGCTGTCCAGATCTCGGGTCACTTCCACATTCACCAC 

CTGCTGGGGAACCCTGAGTTTGCTGCCCCAGAAGTCATTCAAGGCATCCCCGTCTCCCTG 

GGGACAGACATCTGGAGCATCGGGGTTCTGACATATGTCATGGTGAGTGGGGTCTCCCCC 

TTCTTGGATGAGAGCAAAGAGGAGACATGTATCAACGTATGCAGGGTGGATTTCAGCTTC 

rCCCATGAATACTTCTGTGGTGTGAGCAATGCTGCCAGAGATTTCATCAATGTGATCTTA 

CAGGAAGATTTTCGGAGGCGGCCCACAGCAGCCACATGCTTGCAGCATCCATGGCTGCAG 

CCCCATAATGGCAGCTACTCTAAGATCCCCCTGGACACCTCCCGCCTAGCATGCTTCATA 

GAACGTCGCAAGCACCAGAATGATGTGCGGCCTATCCCCAATGTCAAGAGCTACATTGTC 

AACCGGGTGAACCAAGGGACGTAG 
SEO ID NO: 45 5R72 8 2_H 

rGCCGCTGTTTGTCCTCGCGCGGCCCCGTCCACTGCCCTGCGGTTGCTCTGCGGGCTGAA 
AAGTTTCTCCCGGTGCAGAATTCCGGGCTCAGCGACAGCCTGCGCCGAGTGTGCGCACCT 
GTCGGAGACCCGCCAGTCCGCCGGCCCCGGCTTTGTTCGTGCGGAACTGTAGTGGTGAGA 
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FIGURE 2KK 

AAAACTCCATGTCTGGGCACGCCTGGCTGATCTTCACCTCTTTCTTCTAGGACCTTCCTC 

TGGGCTGTCACGTGTGAATATGTGTCTAGTGCATCCTTAACCTGAGGACTTCACCAGTTC 

GAAATTACAGTTTTCACCATCAACTACCTTATCCTTTTTGGCCTGGTTTTCTTCCTCAAA 

CAGTGGAAACATTTTTAAAGTTGCTTTTGTTGCAGAGTTAAACAAATGGCTGATAGTGGC 

TTAGATAAAAAATCCACAAAATGCCCCGACTGTTCATCTGCTTCTCAGAAAGATGTACTT 

TGTGTATGTTCCAGCAAAACAAGGGTTCCTCCAGTTTTGGTGGTGGAAATGTCACAGACA 

TCAAGCATTGGTAGTGCAGAATCTTTAATTTCACTGGAGAGAAAAAAAGAAAAAAATATC 

AACAGAGATATAACCTCCAGGAAAGATTTGCCCTCAAGAACCTCAAATGTAGAGAGAAAA 

GCATCTCAGCAACAATGGGGTCGGGGCAACTTTACAGAAGGAAAAGTTCCTCACATAAGG 

ATTGAGAATGGAGCTGCTATTGAGGAAATCTATACCTTTGGAAGAATATTGGGAAAAGGG 

AGCTTTGGAATAGTCATTGAAGCGACAGACAAGGAAACAGAAACGAAGTGGGCAATTAAA 

AAAGTGAACAAAGAAAAGGCTGGAAGCTCTGCTGTGAAGTTACTTGAACGAGAGGTGAAC 

ATTCTGAAAAGTGTAAAACATGAACACATCATACATCTGGAACAAGTATTTGAAACGCCA 

AAGAAAATGTACCTTGTGATGGAGCTTTGTGAGGATGGAGAACTCAAAGAAATTCTGGAT 

AGGAAAGGGCATTTCTCAGAGAATGAGACAAGGTGGATCATTCAAAGTCTCGCATCAGCT 

ATAGCATATCTTCACAATAATGATATTGTACATAGAGATCTGAAACTGGAAAATATAATG 

GTTAAAAGCAGTCTTATTGATGATAACAATGAAATAAACTTAAACATAAAGGTGACTGAT 

TTTGGCTTAGCGGTGAAGAAGCAAAGTAGGAGTGAAGCCATGCTGCAGGCCACATGTGGG 

ACTCCTATCTATATGGCCCCTGAAGTTATCAGTGCCCACGACTATAGCCAGCAGTGTGAC 

ATTTGGAGCATAGGAGTCGTAATGTACATGTTATTACGTGGAGAACCACCCTTTTTGGCA 

AGCTCAGAAGCGAAGCTTTTTGAGTTAATAAGAAAAGGAGAACTACATTTTGAAAATGCA 

GTCTGGAATTCCATAAGTGACTGTGCTAAAAGTGTTTTGAAACAACTTATGAAAGTAGAT 

CCTGCTCACAGAATCACAGCTAAGGAACTACTAGATAACCAGTGGTTAACAGGCAATAAA 

CTTTCTTCGGTGAGACCAACCAATGTATTAGAGATGATGAAGGAATGGAAAAATAACCCA 

GAAAGTGTTGAGGAAAACACAACAGAAGAGAAGAATAAGCCGTCCACTGAAGAAAAGTTG 

AAAAGTTACCAACCCTGGGGAAATGTCCCTGAGACCAATTACACTTCAGATGAAGAGGAG 

GAAAAACAGTCTACTGCTTATGAAAAGCAATTTCCTGCAACCAGTAAGGACAACTTTGAT 

ATGTGCAGTTCAAGTTTCACATCTAGCAAACTCCTTCCAGCTGAAATCAAGGGAGAAATG 

GAGAAAACCCCTGTGACTCCAAGCCAAGGAACAGCAACCAAGTACCCTGCTAAATCCGGC 

GCCCTGTCCAGAACCAAAAAGAAACTCTAAGGTTCCCTCCAGTGTTGGACAGTACAAAAA 

CAAAGCTGCTCTTGTTAGCACTTTGATGAGGGGGTAGGAGGGGAAGAAGACAGCCCTATG 

CTGAGCTTGTAGCCTTTTAGCTCCACAGAGCCCCGCCATGTGTTTGCACCAGCTTAAAAT 

TGAAGCTGCTTATCTCCAAAGCAGCATAAGCTGCACATGGCATTAAAGGACAGCCACCAG 

TAGGCTTGGCAGTGGGCTGCAGTGGAAATCAACTCAAGATGTACACGAAGGTTTTTTAGG 

GGGGCAGATACCTTCAATTTAAGGCTGTGGGCACACTTGCTCATTTTTACTTCAAATTCT 

TATGTTTAGGCACAGCTATTTATAGGGGAAAACAAGAGGCCAAATATAGTAATGGAGGTG 

CCAAATAATTATGTGCACTTTGCACTAGAAGACTTTGTTAGAAAATTACTAATAAACTTG 

CCATACGTATTACAGCAGAAGTGCTTCAGTCATTCACATGTGTTCGTGAGATTTTAGGTT 

GCTATAGATTGTTTAAGACAGCTTATTTTAAATGTAGAAAAATAGGAGATTTTGTAACTG 

CTTGCCATTAACTTGCTGCTAAATTCCCAATGTATTGATTAAATCAATAAAAAACAGATG 

TTACTC 

SEQ ID NO: 4 6_SGK309_H 

GGGTCCGCAGCCCGCCCTCACAGGCCCTCCTCACTCCCCTAGGTAGATGGCCCCCTCAGG 
GCAGGCCCGGCGGACACCCCTCCCTCTGGCTGGCGGATGCAGTGCCTAGCGGCCGCCCTT 
AAGGACGAAACCAACATGAGTGGGGGAGGGGAGCAGGCCGACATCCTGCCGGCCAACTAC 
GTGGTCAAGGATCGCTGGAAGGTGCTGAAAAAGATCGGGGGCGGGGGCTTTGGTGAGATC 
TACGAGGCCATGGACCTGCTGACCAGGGAGAATGTGGCCCTCAAGGTGGAGTCAGCCCAG 
CAGCCCAAGCAGGTCCTCAAGATGGAGGTGGCCGTGCTCAAGAAGTTGCAAGGTTCGGGC 
CTCGGGCAGGGGGATGGGAAGGAAGAGATGATGAAGCCAGGGGCTAAGAGAGGGAAGGAC 
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FIGURE 2LL 

CATGTGTGCAGGTTCATTGGCTGTGGCAGGAACGAGAAGTTTAACTATGTAGTGATGCAG 

CTCCAGGGCCGGAACCTGGCCGACCTGCGCCGTAGCCAGCCGCGAGGCACCTTCACGCTG 

AGCACCACATTGCGGCTGGGCAAGCAGATCTTGGAGTCCATCGAGGCCATCCACTCTGTG 

GGCTTCCTGCACCGTGACATCAAGCCTTCAAACTTTGCCATGGGCAGGCTGCCCTCCACC 

TACAGGAAGTGCTATATGCTGGACTTCGGGCTGGCCCGGCAGTACACCAACACCACGGGG 

GATGTGCGGCCCCCTCGGAATGTGGCCGGGTTTCGAGGAACGGTTCGCTATGCCTCAGTC 

AATGCCCACAAGAACCGGGAGATGGGCCGCCACGACGACCTGTGGTCCCTCTTCTACATG 

CTGGTGGAGTTTGCAGTGGGCCAGCTGCCCTGGAGGAAGATCAAGGACAAGGAACAGGTA 

GGGATGATCAAGGAGAAGTATGAGCACCGGATGCTGCTGAAGCACATGCCGTCAGAGTTC 

CACCTCTTCCTGGACCACATTGCCAGCCTCGACTACTTCACCAAGCCCGACTACCAGTTG 

ATCATGTCAGTGTTTGAGAACAGCATGAAGGAGAGGGGCATTGCCGAGAATGAGGCCTTT 

GACTGGGAGAAGGCAGGCACCGATGCCCTCCTGTCCACGAGCACCTCTACCCCGCCCCCA 

GCAGAACACCCGGCAGACGGCAGCCATGTTTGGGGTGGTCAATGTGACGCCAGTGCCTGG 

ggacc^Stccgggagaacaccgcggatgto 

GAATGCACCCCCAATTCTGCCCGGGAGGCCCTCTGAGGGGCTGGGCCACAGTCCCCACCT 
TGTCCCCCACCCCGGGGGTCCTGAGGCTGAAGTCTGGGAGGAGACAGATGTCAACCGGAA 
CAAACTCCGGATCAACATCGGCAAAGTAACTGCCGCCAGGGCGAAGGGCGTGGGTGGCCT 
™?CTCACCCCCGATTCCCAGCCTTGTGCCCCTGCCCTGTTCCTCCTAAGCACCCTGT 
CCCCCGCCAATCTCCCTGCTTGCCCGGCCTCTGTTTCCGGTCCCCTCCCCGGCACTAGCC 
TCGCTGTGTCTTCCATCATCATCATCCTCTGTCTCCTTCACACTGAGGAGACCATCCGCC 

GGCGCGCCAGATATC^CACGTC 

GAGCAGGTGCTGGCACAAGAGCAGCGGCTTGGGGGAGCCGGCAGCAGCAGTAACAGCAGC 

agSg?cgccgccgccgccgccagtaaacgcggaccgtaccccaggggactaccc^^ 

GCCGGCCCTGGAAGCCGCGCTCGGGTCCCGCCGCAGTCGGCGGTGGGGGATGGGCAGGCA 

SgcgScccgcctgccgagggttaacccc 

GCCCTCCTCCAGAAACCCCTGCGTCCGCCACGGCCCAGGTTAAATGGAAACCACCCTTGG 

gSSggatgcctgtgtagctgttctaccatatcagtgtattgcaatgagtgggggag^ 

G^CAGCTGGATATCCTGAGTGTTGGAATCCTAGTGAAAGAAAGATGGAAAGTGTTGAGA 
AAGATTGGGGGTGGGGGCTTTGGAGAAATTTACGATGCCTTGGACATGCTCACCAGGGAA 
AtSSGScTGAAGGTGGAATCAGCTCAACAACCAAAACAAG 

g^SSttgaaaaagctgcaagggaaagaccatgtttgtagatttattggctgtggga^ 

^TGATCGATTCAACTATGTGGTCATGCAGTTGCAGGGTCGGAATCTGGCAGATCTTCGC 

^ag^agtcccgaggcacattcaccattagtaccactctccggctgggtagac^ 
Sggagtctattgaaagcattcattctgtgggatcttgncatcgagacatcaa^ 
II??t?gctatgggtcgctttcctagtacatgtaggaaatgttacatgcttgattttgg^ 
^ggctcgacaatttaccaattcctgtggtgacgtcagaccacctcgagctgtggcaggt 

S?cgISgLagttcgttatgcatcaatcaacgcacatcggaac^ 
cSgSgacctttggtccttattctacatgttggtggagtttgtggt^^ 

CTCATG^TGAAACATCTCCCTCCAGAATTCAGCATCTTTCTAGACCATATCTCTTCTTTG 
GATTATTTTACAAAACCAGACTACCAGCTTCTTACATCCGTGTTTGACAATAGCATC^ 
ACTITTGGAGTAATTGAGAGTGACCCTTTTGACTGGGAGAAGACTGGAAATGATGGCTCC 

ctISIaccaccactacttctaccacccctcagttgcacactcgcttgacccctgctgca 
StcgaIttgccaatgctactcccatccctggagacttgcttcgagaaaatacagatc 
SStt^cagatgaacagcttagcgatggagaaaatggcatccctgttggtgtgtcacc^ 
gSaaISgcctggatctctgggacacccccgtccccaggagaaggatgtttgggaagag 

atcgatgccaacaaaaacaagataaagcttggaatttgt^^^ 
ScagSatggccaggcaaatggtcttctcaatgctccaagcct^ 
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GTCCGCTCAGAGATTACTCAGCCAGACAGAG^ 
ATTCACAGCTTTGAGCTGGAAAAA^ 

TTCCTTGAGACCTGGTATAAAATAGTGTATTTTTCTTll aaaqTCTTTCAAA 

Ittattgttgtcattgttg^ 

TAAGAAATCCTTGCATTTTTGTAACACTG^ 

TTTAATTATTATTATCCTGATTCTTAAT^ 

TGGAAGCTGAGCAGTAAGAGCTGATGACTTCCTAA^ 

TTTTACAGTAACTTTGTTTAGA^GTAATCTCTTC^ 

GGCATGATAAAGATGGCATTAGGCAGAG^^ 

SEQ ID NO: 48 - AA4 ^^f-" rTGTTTTTG ATTTAGAGGCGAAATCAGCACGTGGT 
ACTTTTACTATATTCTTTGAGATGACTGTTTTTG^ 

GGCTCAAATCTCCTTATGGATAGTG^ 
CGGGGCCTGGCGTACATCCACCACCAACACGTTCTTC^^ 

TTACTCATCAGTCACCTGGGAGAGCTCAAAC^ 

TCCATTCCCAGCCAGACATACTCTTCAGAAGTC^ 

GCTTTGCTGGGAGCCACTGAATATTCCTCTGAG^ 

TTTATTGAAATGTTCCAGGCTCAACCTTTGTTTCCTGG 

CTGGAGAAAATCTGGGAGGTGCTGGGAGTCCCTACA^ 

AAGCTACCTAACTACAATCCAGAATGGT^CCAC 

GTCTGGAACAGGCTGGGCAGGGTTCCTGAAGCI TTCATGATT ATTTCAGCGCC 
GGCTTTCCCAGAGACCGCGTCTCCGCCCAGGAAGCACli^ TACAGTTTC AGGAGTG 

Sgccatctcagctgtaccag^ 

AGGCTAAAGCCAGAAATGTGTGACCTTTTGGCCTCC^ 

CAGTTTAGCAAATGCTGGTGAAAAGAAAGGGCGAGAT^ 

ATTTCTGCAGTTTCGGTTTTCATTTGCTTCAGCTTACTAA^ 

atLtgaacaaggggctttatgtcctc™ 

tttctgtcatatacataactgtattcacagagatataaaa GATTCC aggaga 
taagcaaggaataatcaagatttgtg^acttcgg^gttt^ 

TGCCTACACCGATTATGTAGCTACGAGATGGTACCGA^ 
TACTCAGTATGGTTCTTCAGTCGATATATGGGCTATTG^ 

GACAGGCCAGCCACTGTGGCCTG^^ 
AACACTAGGAAAATTAATCCCAAGACATCAA^ 

tggcatcagtatacctgagccagaagaca^^ 

TCATCCTGTGGCTCTGAACTTCATGAAGGGGTGTCT^ 
AACCTGTTCCCAACTCCTGGAGAGCTCCTACTTTGAT^ 

AAGAAAAGCACGTAATGAAGGAAGA^^^ 
CATACCAGGAAGCCACATCTCCCCCACACCTGATG^ 
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FIGURE 2NN 

TGCTTTTACAT 

SEQ ID NO: 5 0 AAO 617 9 7_M » GTTGAAACACCCAAACCTCGTGAACCT 

SSS^SSS 5SSSE3S 

TATCTTTAGGAGTAACCAGTTTTTCCGCGGCAT^ 

TCAGTCTTTTCAAGAGGATC^TGAAM^=CCCG CTCCCCCACAC CTGA ■ 

CA^CCACAGTAACGCCCCTGGATCC^ 
CCACCGGCCATTTCTCTAAATATCATTCTG^ 
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FIGURE 200 

TOCAG^GTCATCGGGAGGCG^GC^ 

AAGCRCRAGCGGCATAAGTC^CACTCCRAMACAT GATGMAT CAGCTCTGRTTCC 
<X»TCCCTGGGCACM^ 

1 



GACTTA 1 



GGATCGATGAAGGACCGGATAT^^-— C AGGTCATCCAAGCTCCAC 
TATGGGAAGGCGCAGGTAGCCAAAA^ 
AAGGAGAAGACCAGGAAAGAACGGGAGCTG^ 

CATCGAAAAAGGGAAACACCCAAAAGTT^ 
AGGAGCCCCCACAGGAAGTGGTCTGACAGCTC^^ 
TCTTATGGCCAAGATTATGACCTTAGTCOT^ 
TCCTACAAGAAAAGTCCTGGAAGTACCTCGAGAAGG^ 

gcISttcaagacattcatctt = 

CATTCCAGTATCTCACCTGTCAGGCTTCCACT^ 

AGGAAAAAGAAGGAAAGAGCAGCTGCTGCTGCTGC^ 

AAGGGTTCACCTGTATTTTTGCCTAGAAAAGAGAACA^ 

GGTTTGGAGTCTAAAAAGTTACCCAG^^ 

GAACTGGTGAATGTAACACATCT^^ 
GTAAAGTTGGATGAGAACTCCGAGAAGCATCT^^ 

AGAGACTCTAAACCCATAGCACTG^ 
TCAGAAAAGGAGACCCCTCCACCTCTTCCC^ 
ACTACTACCCCTCCACCTCAGACACCCCCTTTG^^^ 
CCACAGCAACCACCTCTGCCTCCTTCTCAG^AGCA^ 
ACTTCAACTTTGCCCCCTTCTACTCACTCAAAG 
TCTCAGCCCCCTGTACAGGTTTCTGTG^ 
CACCTGAAAACTTCAACGTTGCCTCCT^^ 
GACATGGATAGTCCAAAAGAAACTCTTCC^ 



AGC^CACGTCACTTACTCACAGA^ 
TCTCXSCCAGACTC^ 

AGACCAAAAATTTGTTGTCCTCGTTATGG^ 

AAACGCTGTGTGGACAAGTTTGACATTAT^ 

GTATATAAAGCCAGGGACAAAGACACAGG^ 
GACAATGAGAAAGAGGGCTTCCCAATCACAGC^ATT ^ 

TTAATCCACCGAAGTGTTGTTAACATGAAGG^ 
GATTTCAAGAAGGACAAAGGTGCCTTTTACCTTGTAT^ 

ATGGGACTGCTAGAATCTGGTTTGGTGCACTT 

AAACAGCTAATGGAAGGATTGGAAraC^ 



^TCGGCTCTATAACTCTGAAGAGAGTCGC^ 
TACCGACCTCCAGAACTACTGCTAGGAGAGG^ 
AGCTGTGGATGTATTCTTGGGGAACTATT^ 
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FIGURE 2PP 



CGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATG 
CTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTT 
AAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCAT 
GAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCA 
CCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAG 
AACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGAT 
GCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTA 
AACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATC 
CACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTG 
ACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAG 
GAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGATGACCCTTGAAGCTTCA 
AGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACC 
CAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGG 
CCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTT 
CCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCT 
CTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCC 
TTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCAC 
CAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAAC 
ACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCA 
GCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTG 
AGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGAC 
CAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTC 
CTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTAT 
GGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGG 
GGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCA 
AGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAA 

SEQ ID NO: 52_AA789239_H 

TGAAAATGGAGATGTATGAAACCCTTGGAAAAGTGGGAGAGGGAAGTTACGGAACAGTCA 
TGAAATGTAAACATAAGAATACTGGGCAGATAGTGGCCATTAAGATATTTTATGAGAGAC 
CAGAACAATCTGTCAACAAAATTGCGATGAGAGAAATT^AAGTTTCTAAAGCAATTTCATC 
ACGAAAACCTGGTCAATCTGATTGAAGTTTTTAGACAGAAAAAGAAAATTCATTTGGTAT 
TTGAATTTATTGACCACACAGTATTAGATGAGTTACAACATTATTGTCATGGACTAGAGA 
GTAAGCGACTTAGAAAATACCTCTTCCAGATCCTTCGAGCAATTGACTATCTTCACAGTA 
ATAATGTAATCATTCATCGAGATATAAAACCTGAGAATATTTTAGTATCCCAGTCAGGAA 
TTACTAAGCTCTGTGATTTTGGTTTTGCACGAACACTAGCAGCTCCTGGGGACATTTATA 
CGGACTATGTGGCCACACGCTGGTATAGAGCTCCCGAATTAGTATTAAAAGATACTTCTT 
ATGGAAAGTATGTGCCTGTGGATATCTGGGCTTTGGGCTGTATGATCATTGAGATGGCCA 
CTGGAAATCCCTATCTTCCTAGTAGTTCTGATTTGGATTTACTCCATAAAATTGTTTTGA 
AAGTGNGATTCATGCCAGAACTGAAAGCTAAATTACTGCAGGAAGCAAAAGTCAATTCAT 
TAATAAAGCCAAAAGAGAGTTCTAAAGAAAATGAACTCAGGAAAGATGAAAGAAAAACAG 
TTTATACCAATACACTGCTAAGTAGTTCAGTTTTGGGAAAGGAAATAGAAAAAGAGAAAA 
AGCCCAAGGAGATCAAAGTCAGAGTTATTAAAGTCAAAGGAGGAAGAGGAGATATCTCAG 
AACCAAAAAAGAAAGAGTATGAAGGTGGACTTGGTCAACAGGATGCAAATGAAAATGTTC 
ATCCTATGTCTCCAGATACAAAACTTGTAACCATTGAACCACCAAACCCTATCAATCCCA 
GCACTAACTGTAATGGCTTGAAAGAAAATCCACATTGCGGAGGTTCTGTGACAATGCCAC 
CCATCAATCTAACTT^ACAGTAATTTGATGGCTGCAAATCTCAGTTCAAATCTCTTTCACC 
CCAGTGTGAGGTTAACTGAAAGAGCAAAAAAGAGACGCACTTCTTCACAATCTATTGGAC 
AAGTTATGCCTAATAGCAGGCAAGAGGATCCAGGTCCTATTCT^AAGCCAAATGGAGAAGG 
GTATATTTAATGAGCGAACAGGTCACAGTGACCAAATGGCAAATGAGAACAAAAGGAAGC 
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CTCAGCTAACTATCA^GGT^ 

GTTTTTTTTCTGGTAGTGTCmTCTTTTACAi GTTGTT AAGACAGCCATCC 
TTCCCGTTTTATTTATCTACAATAGAAG^ 

CTTGAA 

CTGGCAGATATAGTTCATGCTTGTTTACW 
TTTAAAGAAAATGAACCTGTGAGAGATGAGA^ 

GGTGTCAAAGAAGACCCACACGCTGGGG^ 

AGCAGTAATTTGTTGGCCGCAAATCTCAGTTCAAA^ 

ACTGAAAGAACAAAAAAGAGACGCACTTCTTCA^ 

GTCTTTTAAGTAATCTTAAAAATACAAGCTTGAC^ 

Sttttgatoctgtgatttgaaatta^gtca^ 
gttaacat 
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cgtattgcagcctcccaggcccttctgcatcagtac^^ 

CTT^GTGCTCATTTGGGAGCGAGGGTCCMWG^gCCCA 

gaacaac^aS^ 

CGGCATTCCAAAAGAACTCA^CTG^^TM^^ 

AGTAAATCTTCAGTCCGCAGCAGGAGAAG^ 

CATGGTCATGTTTGTATTGTGTTTGA^CTACTGGG^ 
MTT TGTTTGTGAAGTCTGACTATGTAGTCAMTATAA 

ATATTAGGACCC^TACCACAACACATGATTCAG^ 
GTTCGAAGAATGTTAGAATATGA^ 

TCTTTTTGAAATTACCATTTTTAAAWCCTTOAAATA GRTTTTTTTTGAGT AAA 
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ACTTAA.CTTTAAAAGTTTTTCTTCTGTAATTGTTGAACGGGTGATTATTATTAACTCTAG 
ATAAGCAGGTACTAGAAACCAAAACTCAGAAAATGTTTACTGTTAGAATTCTATTAAATT 
TTAAGTGTTGTATTCTTTTTCATTGGGTGATGTCAGGGTGATAACCAGACATTCATGGAA 
AGGCATGCAGTTTGTCCATTGTGACAGTTTGTTTAATAAAACCACATACACACTTTATTT 
AAGATTAAAATCTAACTGGAAAGTCAGCTTGGAAAATGGACATTTCCAAGTATGTTTGGT 
GAGTCACAGATATAAAAATAGAAATTCTGATGAGAGGTTTCAGTTTTTAATACCAAGTCC 
TTAGGAGTCTTAACATTGGCCAGCATCTGTTTATCAAATGACATAAATACGTAAACCTAT 
AAGAATTAAGTTTATTAATTAGGCAATTTATGTCTGTGATAATTCTTACGGGAGAAAGAG 
GATTTGATTGGAAAGCAGTTTGGGAAGAAAGTGCTGCTGAAATTTCCAGAATTTAATTGA 
TTGGTTACATAAACTTTTTGACTTCAAT 

SEQ ID NO: 5 6_AA5 5 7 5 3 6_H 

AGTAAGGCCCCGCGGGCGTCCTGGCCGCCATGTGCACCGTAGTGGACCCTCGCATTGTCC 

GGAGATACCTACTCAGGCGGCAGCTCGGGCAGGGGAGAACATTCCGGGAAATCACGCTCC 

TCCAGGTGAGTGGCCTGGGCCCTCCAGTCCAATCCCCTTGCCCAGGTACAGATCTCTCCA 

GACAGGAGAGAAACTGGCCTTCTTGGGCCCCAGAGCACAGCCCCTCCTGGCCTTCCAGCC 

GCCTCCGACTCTCTCCCCAGGAGTTTGGGGACCATCCCAACATCATCAGCCTCCTTGACG 

TGATCCGGGCAGAGAACGACAGGGACATTTACCTGGTGTTTGAGTTTATGGACACTGACC 

TGAACGCAGTCATCCGGAAGGGCGGCCTGCTGCAGGACGTCCACGTGCGCTCCATCTTCT 

ACCAGCTCCTGCGGGCCACCCGGTTCCTCCACTCGGGGCACGTTGTGCACCGGGACCAGA 

AGCCGTCCAATGTGCTCCTGGATGCCAACTGCACAGTGAAGCTGTGTGACTTTGGCCTGG 

CCCGCTCCCTGGGCGACCTCCCTGAGGGGCCTGAGGACCAGGCCGTGACAGAGTACGTGG 

CCACACGCTGGTACCGAGCACCGGAGGTGCTGCTCTCTTCGCACCGCTACACCGCTTCCT 

GCCCCAGATACACCCTTGGGGTGGACATGTGGAGTCTGGGCTGTATCCTGGGGGAGATGC 

TGCGGGGGAGACCCCTGTTCCCCGGCACGTCCACCCTCCACCAGCTGGAGCTGATCCTGG 

AGACCATCCCACCGCCATCTGAGGAGXXXAGGCCACGACAGACGCTGGATGCCCTCCTAC 

CGCCAGACACCTCCCCAGAGGCCTTGGACCTCCTTAGGCGACTCCTGGTGTTCGCCCCGG 

ACAAGCGGTTAAGCGCGACCCAGGCACTGCAGCACCCCTACGTGCAGAGGTTCCACTGCC 

CCAGCGACGAGTGGGCACGAGAGGCAGATGTGCGGCCCCGGGCACACGAAGGGGTCCAGC 

TCTCTGTGCCTGAGTACCGCAGCCGCGTCTATCAGATGATCCTGGAGTGTGGAGGCAGCA 

GCGGCACCTCGAGAGAGAAGGGCCCGGAGGGTGTCTCCCCAAGCCAGGCACACCTGCACA 

AACCCAGAGCCGACCCTCAGCTGCCTTCTAGGACACCTGTGCAGGGTCCCAGACCCAGGC 

CCCAGAGCAGCCCAGGCCATGACCCTGCCGAGCACGAGTCCCCCCGTGCAGCCAAGAACG 

TTCCCAGGCAGAACTCCGCTCCCCTGCTCCAAACTGCTCTCCTAGGGAATGGGGAAAGGC 

CCCCTGGGGCGAAGGAAGCGCCCCCCTTGACACTCTCGCTGGTGAAGCCAAGCGGGAGGG 

GAGCTGCGCCCTCCCTGACCTCCCAGGCTGCGGCTCAGGTGGCCAACCAGGCCCTGATCC 

GGGGTGACTGGAACCGGGGCGGTGGGGTGAGGGTGGCCAGCGTACAACAGGTCCCTCCCC 

GGCTTCCTCCGGAGGCCCGGCCCGGCCGGAGGATGTTCAGCACCTCTGCCTTGCAGGGTG 

CCCAGGGGGGTGCCAGGGCTTTGCTTGGAGGCTACTCCCAAGCCTACGGGACTGTCTGCC 

ACTCGGCACTGGGCCACCTGCCCCTGCTGGAGGGGCACCATGTGTGAGCCGCCCTACTCC 

CTTCACCTGGCCCTCTGTTCCTGCCCCAGCNCCTTCCCCAGACCCCTCTCCAGTCTCCTG 

CACCCCTTAGCCCTCCCTGCTTTGCCTGGCCCGTTGAAGTTCCAGGGAGCTTGCCCGGGT 

CTCCTCGGGGGAGCAGATGAGGGCCCTGCCC 

SEQ ID NO: 57_N28606_H, MOK_H 

ATGAAGAACTATAAAGCAATTGGCAAAATAGGAGAGGGAACGTTTTCTGAAGTTATGAAG 
ATGCAAAGCCTGAGAGATGGAAACTACTATGCATGTAAACAAATGAAGCAGCGCTTTGAA 
AGTATTGAGCAAGTCAACAACCTACGAGAGATCCAAGCACTGAGGCGCCTGAATCCGCAC 
CCAAACATTCTTATGTTGCATGAAGTGGTTTTTGACAGAAAATCTGGTTCTCTTGCACTA 
ATATGTGAACTTATGGACATGAATATTTATGAGCTAATACGAGGGAGAAGATACCCATTA 
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TCAGAAAAAAAAATTATGCACTATATGTACCAGTTATGTAAGTCCCTGGATCATATTCAC 
AGAAATGGAATATTTCACAGAGATGTAAAACCAGAAAATATACTAATAAAGCAGGATGTC 
CTGAAATTAGGGGACTTTGGCTCCTGCCGGAGTGTCTATTCCAAGCAGCCGTACACGGAA 
TACATCTCCACCCGCTGGTACCGGGCCCCGGAGTGTCTCCTCACTGATGGGTTCTACACG 
TACAAGATGGACCTGTGGAGCGCCGGCTGTGTGTTCTACGAGATCGCCAGTCTGCAGCCC 
CTCTTTCCTGGAGTAAATGAACTGGACCAAATCTCAAAAATCCACGATGTCATCGGCACA 
CCCGCTCAGAAGATCCTCACCAAGTTCAAACAGTCGAGAGCTATGAATTTTGATTTTCCT 
TTTAAAAAGGGATCAGGAATACCTCTACTAACAACCAATTTGTCCCCACAATGCCTCTCC 
CTCCTGCACGCAATGGTGGCCTATGATCCCGATGAGAGAATCGCCGCCCACCAGGCCCTG 
CAGCACCCCTACTTCCAAGAACAGAGGAAAACAGAGAAGCGGGCTCTGGGCAGCCACAGA 
AAAGCTGGCTTTCCGGAGCACCCTGTGGCACCGGAACCACTCAGTAACAGCTGCCAGATT 
TCCAAGGAGGGCAGAAAGCAGAAACAGTCCCTAAAGCAAGAGGAGGACCGTCCCAAGAGA 
CGAGGACCGGCCTATGTCATGGAACTGCCCAAACTAAAGCTTTCGGGAGTGGTCAGACTG 
TCGTCTTACTCCAGCCCCACGCTGCAGTCCGTGCTTGGATCTGGAACAAATGG7VAGAGTG 
CCGGTGCTGAGACCCTTGAAGTGCATCCCTGCGAGCAAGAAGACAGATCCGCAGAAGGAC 
CTTAAGCCTGCCCCGCAGCAGTGTCGCCTGCCCACCATAGTGCGGAAAGGCGGAAGATAA 

SEQ ID NO: 58_AB023 153_H , ICK_H 

ATGAATAGATACACAACAATCAGGCAGCTCGGGGATGGAACCTACGGTTCCGTCCTGCTG 
GGAAGAAGCATTGAGTCTGGGGAGCTGATCGCTATTAAAAAAATGAAAAGAAAATTTTAT 
TCCTGGGAGGAATGCATGAACCAACGGGAGGTTAAGTCTTTAAAGAAGCTCAACCATGCC 
AATGTAGTCAAATTAAAAGAAGTTATCAGGGAAAATGATCATCTTTATTTTATCTTCGAG 
TACATGAAGGAAAATCTTTACCAGCTCATTAAAGAGAGAAATAAGTTGTTTCCTGAGTCT 
GCTATAAGGAATATCATGTATCAGATATTACAAGGACTCGCATTTATTCACAAACTCGGC 
TTCTTTCATCGAGACTTAAAGCCTGAGAACCTCCTCTGCATGGGACCAGAACTTGTGAAA 
ATTGCAGACTTTGGTTTGGCCCGAGAAATACGATCAAAACCTCCATATACAGATTATGTA 
TCTACCAGATGGTACAGGGCTCCAGAAGTACTCCTGAGGTCTACCAACTACAGCTCCCCC 
ATTGACGTCTGGGCGGTGGGCTGCATCATGGCAGAAGTTTACACCCTCAGGCCACTCTTC 
CCTGGAGCCAGTGAAATTGACACAATATTCAAAATTTGCCAAGTGCTGGGGACACCAAAA 
AAGACTGACTGGCCTGAAGGCTATCAACTTTCAAGTGCAATGAACTTCCGTTGGCCACAG 
TGTGTACCCAATAACTTAAAGACCTTGATTCCCAATGCTAGCAGTGAAGCAGTCCAGCTC 
CTGAGAGACATGCTTCAGTGGGATCCCAAGAAACGACCAACAGCTAGTCAGGCACTTCGA 
TATCCTTACTTCCAAGTTGGACACCCACTAGGCAGCACCACACAAAACCTTCAGGATTCA 
GAAAAACCACAGAAAGGCATCCTGGAAAGGGCAGGCCCACCTCCTTATATTAAGCCAGTC 
CCACCTGCCCAGCCACCAGCCAAGCCACACACACGAATTTCTTCACGACAGCATCAAGCC 
AGCCAGCCCCCTCTGCATCTCACGTACCCCTACAAAGCAGAGGTCTCCAGGACAGATCAC 
CCAAGCCATCTCCAGGAGGACAAGCCAAGCCCGTTGCTTTTCCCATCCCTCCACAACAAG 
CATCCACAGTCGAAAATCACAGCTGGCCTGGAGCACAAAAATGGTGAGATAAAGCCAAAG 
AGTAGGAGAAGGTGGGGTCTTATTTCCAGGTCAACAAAGGATTCAGATGATTGGGCTGAC 
TTGGATGACTTGGATTTCAGTCCATCCCTCAGCAGGATTGACCTGAAAAACAAGAAAAGA 
CAGAGTGATGACACTCTCTGCAGGTTTGAGAGTGTTTTGGACCTGAAGCCCTCTGAGCCT 
GTGGGCA CAGGAAACAGTGCCCCCACCCAGACGTCATATCAGCGGCGAGACACGCCCACC 
CTGAGATCTGCAGCCAAGCAGCACTATTTGAAGCACTCTCGATACTTGCCTGGGATCAGT 
ATAAGAAATGGCATACTCTCGAATCCAGGCAAGGAATTTATTCCACCTAATCCATGGTCT 
AGTTCTGGCTTGTCTGGAAAATCTTCAGGGACAATGTCAGTAATCAGCAAAGTAAATTCA 
GTTGGTTCCAGCTCTACAAGTTCTAGTGGACTGACTGGAAACTATGTCCCTTCCTTTCTG 
AAAAAAGAAATCGGTTCTGCTATGCAGAGGGTACACCTAGCACCTATTCCAGACCCTTCC 
CCTGGTTATTCCTCCCTGT^AGGCCATGAGACCTCATCCTGGGCGACCATTCTTGGACACC 
CAGCCTAGAAGCACTCCTGGGTTGATACCACGGCCTCCAGCCGCCCAGCCAGTGCATGGC 
CGGACAGACTGGGCTTCCAAGTACCCATCCCGGCGGTGA 
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SEQ ID NO: 59_AA83 994 0_M 

AGCAGCAACAATGGTGGCATGAGTGCAGAGGAGGAGATAGGGCCTGGGGCTGAGCCTATG 
AGAGGACCAAGCTTGGCTACAAGGGACTGGAGAGATGAGACTGTTGGGACCACAGACCTG 
CAGCAAGGCATAGACCCAGGAGCAGTGAGCCCTGAGCCTGGGAAGGACCACGCAGCCCAG 
GGCCCAGGAAGAACTGAAGCTGGAAGGGTATCTTCTGCTGCAGAGGCTGCCATTGTGGTT 
CTAGATGACAGCGCAGCACCCCCAGCCCCTTTTGAACACCGGGTAGTGAGCATCAAAGAT 
ACCCTGATCTCAGCAGGCTACACGGTATCCCAACATGAAGTCTTAGGAGGGGGTCGGTTT 
GGCCAGGTGCACAGGTGTACAGAGAGGTCTACAGGCCTTGCACTGGCAGCCAAGATCATC 
AAAGTGAAGAACGTAAAGGACCGGGAGGATGTGAAGAATGAGGTCAACATCATGAACCAG 
CTCAGCCACGTAAACTTGATCCAACTTTATGATGCGTTTGAGAGCAAGAACAGCTTCACT 
CTGATCATGGAGTATGTGGATGGAGGCGAACTCTTTGACCGGATCACGGATGAGAAGTAC 
CACCTCACTGAGTTGGATGTGGTCTTGTTCACGAGGCAGATCTGTGAGGGTGTGCATTAC 
CTGCATCAGCACTATATCCTGCACCTGGACCTCAAGCCTGAGAACATATTGTGTGTCAGC 
CAGACAGGGCATCAAATTAAGATCATTGACTTTGGGCTGGCTAGAAGATACAAGCCTCGG 
GAGAAGCTAAAGGTGAACTTTGGTACTCCGGAGTTCCTGGCCCCAGAAGTTGTTAACTAT 
GAGTTTGTGTCATTTCCAACAGACATGTGGAGTGTGGGAGTTATCACCTACATGCTACTC 
AGTGGTTTGTCCCCATTTCTAGGGGAGACAGATGCAGAGACCATGAATTTTATTGTGAAC 
TGCAGCTGGGATTTCGATGCTGATACCTTCAAAGGGCTGTCGGAGGAAGCCAAGGACTTT 
GTTTCCCGGTTACTGGTCAAAGAGAAGAGCTGTAGGATGAGCGCCACACAGTGCCTGAAA 
CACGAGTGGTTAAATCACCTGCCTGCCAAAGCCTCGGGCTCCAACGTTCGCCTCAGATCC 
CAACAACTGCTGCAGAAATATATGGCTCAGAGTAAATGGAAGAAACATTTCCACGTGGTG 
GCTGCAGTCAACAGGCTACGGAAATTTCCAACGTGTCCCTAATCTTCAACTCTGGTGTTC 
CACTGGGCCTGGGAATTCTTGAGGCAACACGAAGTGGTAATATGAAGAGATTACTCAAGA 
TTTTATGTAGATTGGCGCTTTGCTATTATTGATTTTTCTTATTTTGCAAAGAATGATGGA 
AGGAAGCAAGAAAGAAAGAAAAGAAGAAAAGGGGGAAGAAAAGGAAAAGGCAGAAAGCAA 
GGAAACAGGCTACGTTGTTGCTCTTCTTGTAGGTGAAAGTGTTTTTATTAAAAGCCCTAG 
GAATGTTTTTCTGCCTCGTAAGGTCAGCAGGTCTCATATGCTGCTTGCTACCCCGCACCC 
TTCCTTTTGGTAATAAGAGCAGGCACGCTCAGGATGGGCAGGGAAATCCTACTTGGCTTT 
TGGTCAAATTTGAATTCTAAACTTGTCATGATTAAAGAAGCCAGTAGGGAGGGAGGTATG 
GAAGAGGGAGGAATTAGGTCCAACAGTGGGGGATGAATTTGACCGAAACATTGTATAAAA 
TTCTTAAAGAATTAATAAAATATATTTTTAAAGGAG 

SEQ ID NO: 60_AA460132_H 

GGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGTAACCACTTACAGGCCGGAAG 
TGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCAGGATTGCCGAAGAAGTCCCA 
GGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAGAGACAGCTGATCGGTTGGAG 
CTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTACGCCGGCCGATGGCGAGGAG 
CCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGCGGAGCAGCCGCTTCTTGAGC 
GGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGC 
CGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCG 
CGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCT 
GGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTTCCAACTGCTTATATATGGAA 
GAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGTCCACTATGGAGACTGAAAAA 
ACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGCAGGTTTTGGCTCGAATGCAC 
GATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACATGCTCCTGAAACCCCCCCTG 
GAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTTTCATTTCAGCACTTCCAGAG 
GATTU^GGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGTACCCATCCCAACACT 
GAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCTCCTCCAAAAAGGCCAGGCCA 
GTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAAAGAGGTCCATGGTTGGGTAG 
AAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTTTCAAAGTAAATTTGAAGAAA 
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TGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAGATATTTTTAAGTGGTATGTG 
ATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACTATGTGTCTAAGTCATGTTCT 
AGGCAGAATTGGGTATTTAAAGTAAATTGAGGACAGGCTTCTCCCAGATTGTGACATGTA 
TATCTCAGATACATGGGTGTGGCATTGAACCACATAATGAGAACATTATTCTCTTTTTAG 
TCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTCGCTGAGCTTACTGGCCCTCT 
AACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTATATTTATTTTGAAACCAGTTT 
AATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATACAGCATGG 

SEQ ID NO: 61_SGK034_H 

CAGAGAGAGAAGGTAAACCAAGGGAACATGCCAGGGCTTCAGAGCACCTTCCTAGCCATG 
GACACGGAGGAGGGGGTAGAGGTGGTGTGGAACGAGCTCCACTTCGGAGACAGGAAGGCC 
TTCGCGGCGCACGAGGAGAAGATCCAGACCGTGTTCGAGCAGCTGGTGCTGGTGGACCAC 
CCGAACATCGTGAAGTTGCACAAGTACTGGCTGGATACCTCTGAGGCCTGCGCGAGGGTC 
ATCTTCATCACAGAGTACGTGTCATCAGGCAGCCTCAAGCAATTCCTCAAAAAGACCAAG 
AAGAACCACAAGGCCATGAACGCCCGGGCCTGGAAGCGCTGGTGCACGCAGATCCTGTCT 
GCGCTCAGCTTCCTGCACGCCTGCAGCCCCCCAATCATCCACGGGAACCTGACCAGCGAC 
ACCATCTTCATTCAGCACAACGGCCTCATCAAGATCGGCTCCGTGTGGCACCGAATCTTC 
TCCAATGCACTTCCAGATGATCTCCGAAGCCCCATCCGCGCTGAGCGAGAGGAACTTCGG 
AACCTGCACTTCTTCCCCCCAGAGTATGGAGAGGTGGCCGATGGGACCGCTGTGGACATC 
TTCTCCTTTGGGATGTGTGCGCTGGAGATGGCTGTACTGGAAATCCAGACCAATGGGGAC 
ACCCGGGTCACAGAGGAGGCCATTGCTCGCGCCAGGCACTCGCTGAGTGACCCCAACATG 
CGGGAGTTCATCCTTTGCTGCCTGGCCCGGGACCCTGCCCGCCGGCCCTCTGCCCACAGC 
CTCCTCTTCCACCGCGTGCTCTTCGAGGTGCACTCGCTGAAGCTCCTGGCAGCCCACTGC 
TTCATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTGGAGGAG7VAGACCAAGGCCATG 
GACCTGCACGCGGTCTTGGCGGAGCTTCCCCGGCCCCGCAGGCCCCCGCTGCAGTGGCGG 
TACTCGGAAGTCTCCTTCATGGAGCTGGACAAATTCCTGGAGGATGTCAGGAATGGAATC 
TACCCACTGATGAACTTTGCAGCCACTCGACCCCTGGGGCTGCCCCGTGTGCTGGCCCCA 
CCCCCGGAGGAGGTCCAAAAGGCCAAGACCCCGACGCCAGAGCCCTTTGACTCTGAGACC 
AGAAAGGTCATCCAGATGCAGTGCAACCTGGAGAGAAGCGAGGACAAGGCGCGCTGGCAT 
CTCACTCTGCTTCTGGTGCTGGAAGACCGGCTGCACCGGCAGCTGACCTACGACCTGCTC 
CCAACGGACAGCGCCCAGGACCTCGCCTCGGAGCTCGTGCACTATGGCTTCCTCCACGAG 
GACGACCGGATGAAGCTGGCCGCCTTCCTGGAGAGCACCTTCCTCAAGTACCGTGGGACC 
CAGGCCTGACCCGGAGCCCCAGCCCCAGGGGACCATGCCGGGGTGCTGCCCGGGCAGGCC 
ATGTTGGGGAGACTCCAGCACCGTGGGGCTGCCCTCCTCCATGCGCCTGGGAGCACAAAG 
GCCCCGGTAGTGAAGGAACCCCCCGTCTCCTGAGAGTGGGGCTGACCCTGCCTTGGGCGC 
CGAGGGGTTGGGGGGTGGGTGTGGGGGAGCCGTTAGGCCTCCCAGGTCCTTAGGATCAGG 
GTTGCCCCCAGAACCCCTTCCCATATCCTCCATTCTCCGCCCTGAGTTCCTACCCAGGCT 
GCCTGGCTGGGGCCACTGCCTCCTCAGCATGCAGGAGGCTGCCCTGTAGGGAACCCCAGC 
TCTGGGGCTTGGGGGTGAGGGTCAGCCCTGGACAGACCTCTGCCCAGGGAACTGCTCCAT 
GGGGTCTGGGAGAGCAGCCATCCCCTGCTGGCACCATAGACCCACACAAGGAGCCTGCAC 
AGCAAGCCAGCGGTGACACACCTGCAGGTGTCAGGCATGGCACTGGGCACAACAGGGACC 
TGGCAGGAGAAACAGACCACAGAGAGGTCTGGAGTTGAGGCTGTTGTCAGCAAAGCCCCT 
GGTCCCACACAGCTCTGCCCTAGAGCCACCTCTTTGACCCTTTACCCACCCTGAGACCAG 
AACTTGCAGCCCCTCTGCAGATCTCCTCTGGCCACTGCAGCCCCTCCAATGGGCTTTTTC 
TCTCATGCATTCCCTGGCCTGGAGGCGTCAGGGACCCCACATCCTCCCTGCTCCTCAGAC 
TCACAGCCCCTCCATGTTACCTCCCGCACCTCCTCCCTGGGGCAGCTGCTCCCTGGGCCT 
CTGAGGATGTCAGCTCCTGGCTCCCTGCCTCTCTCCCACTCCACTCCTGGCTCAGTCTTA 
GAGATTTCTATGCCCTCATGGATTCTACCCCTGCCTTCCTGGCCTCTTGATTCTTGGCTT 
GCCTCTCCTCCAATTCCAAACTTAGTGAAATGGCCTTAAGCATTTTAAACTGTATGTATA 
CATTAGCGCATTCATGCCTTTCTAAACGCATTTCAAATGTCAACCAGGAAGGCACACCAC 
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TGTATTAGTTTTATACTGCCGCTGTAAAATTTACCACAAACTTAGTGACTTAACACAAAT 
TTATTGCAATTCTGTAGGCTGGAAGTCTGACTATGGGTCTCACTGGACTAGAATCAAGGC 
TGGCAGGCTGCCTTCCTTCCTGGAGGTTCTAGGGGAGACTCTGTCTCCTGCTCCTTCAGG 
CTGCTGGCAGAATCCACATCCTTTCGGTGGCAGGGCCAAGGTCCCCACTTTCTTGCTGAC 
TGTAAACTAAGGCCACTTCCAGCTTGTAGAGGCTGCCTACATTCCTTGGCTCTTGGCCCC 
CTCCTCCATCTTCAGAGCTAGCAGGTTCAGTCTGTGTCACGAACCATTTCTCTGGTTCCC 
TGCAGACAGGAAAGGTTGTCCCTAAGGACTCATGAGATTAGGTTGGGCCCAGCCAGATAA 
TACATGATAATCTCCCTCCTCAAGGTTTTTAATATTAAACACATCTGCAGGACACATTTT 
GCCATGTAAACTAACATTCACTGGTTCCAGGGATTAAGGAATGAACCTCTTTTGTTGGGG 
AAGGGTGGCATTCTGCTGACCACAGCACTCCAACCAAAAGCCAAAAACCAAAGCAAGACT 
TACTAACGCATATCAAATAAATTAAAGGTACAAAATCGTGAATCTCAGTTATCTTAAATA 
TTCCAATACTATTTACAAAATTATTCAAATTCTCACGCCTTCCAACTCAAAATTAGCAAT 
CTAAAGTAATTTCCATATCCTAGATGGAAACCCTCATGCTAAACTGTCTGATTATGCATG 
GTTCTAAATGGTTTCAGTGGCAAATACATAACATTGTACTACTGATTAAACTGAACTTAA 

AAGC 

SEQ ID NO: 62_AA103218_M SGK034_M 

CCACGCGTCCGCACCAGAGTATGGCGAAGTCAATGATGGGACTGGCTTTGTGGACATCTT 

CTCCTTCGGGATGTGTGCACTGGAGATGGCTGTACTCGAGATCCAAGCCAACGGGGATAC 

CAGAGTCACAGAAGAGGCCATCGCTCGAGCCAGGCACTCACTGAGTGACCCCAACATGCG 

GGAATTCATCCTCTCCTGCCTGGCCCGGGACCCTGCCCGCCGACCCTCAGCCCACAACCT 

CCTCTTCCACCGAGTGCTCTTTGAGGTGCACTCGCTGAAGCTGCTGGCAGCTCACTGCTT 

CATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTAGAGGAAAAGACCAAGGCCATGGA 

CCTCCATGCAGTTTTGGCTGAGATGCCGCAGCCCCATGGACCCCCAATGCAGTGGCGGTA 

CTCAGAGGTCTCCTTCTTGGAGCTGGACAAATTCCTAGAGGATGTCAGGAACGGGATCTA 

TCCACTGATGAACTTTGCGGCTGCTCGGCCCTTGGGGCTTCCCCGTGTGTTGGCCCCACC 

CCCAGAGGAAGCCCAAAAGGCCAAAACTCCAACGCCAGAACCCTTTGACTCGGAGACCAG 

GAAGGTGGTCCAGATGCAGTGCAACCTGGAAAGAAGCGAGGACAAGGCTCGGTGGCACCT 

TACTCTGCTCTTGGTGCTTGAGGACCGGCTACATCGGCAGCTGACCTATGATCTGCTCCC 

AACGGACAGTGCCCAGGACCTCGCTGCTGAACTAGTGCATTATGGCTTCCTGCACGAGGA 

TGACAGGACAAAGCTAGCAGCCTTTCTGGAGACCACTTTTCTCAAGTACCGAGGGACGCA 

AGCGTGACCTTCCCAGTCCTGACGGCCCAGCAGAGATACAGGGGCTCAGGGTTGTCCACT 

TGGCAAAGAGCCCCCACACTGCTCAAAGCTGCCTTCTGCCTGTGTTCCCTGGAACTGAAC 

ACAGGCCCTGCTAGTGAAGACACCCCCACCCCCCAGCTTTCTGCAGCAGTGTGGGACCCT 

GGGGTGGTGATGGAGCCCTGAGCCTGGACGAGAGTGGATACAGGTCAGTTAGGGGAACCG 

CTCCATCTGGTACTAGACAACAGCCATGCCTTCAGGTGGCATAGAAACCTAGGGAAGGAG 

CCTGAACTCAGGTGTCACAGTGCTGGGCATCAGGCAGACCAGACCTGACCTGATTGGAGA 

ACTGTAGACTAGATAGCTTGGAGTTGAACCCATGGCCAGGGAATTCCTTGGTCCTGCTCA 

GACCAGTCCTGATCCCTTGCAGACCTGCCTTGAGCCCTCTTTCTGATCTTCCACACTCTT 

GAGACCAGGACCTGTGTCCTCCCCAAAGCCCTTGGGAAGGATCTTTCTATTCATCATCCC 

TCTGGCCTAGGGGCTCAGGGGTCAGGCATCCTCCACATTCCCTCCCTGGGGAAGTTGTGT 

GTTTGAGTTGAGGATGTGGGTTCCTGGCTCCCTCTTTCTCCCCAGCCCAACTTGTCTCTT 

TCTTACTGGTTTCAAAGTCCTGATGAACGCTTCCCCTCAGAGCCACCCTGGTTTCCTTGG 

TTCTTGAACTGCCTCTCTCCCAACTTCAAACCAGGTCTTAAACGTTTTTTAAATGCATAT 

ATAAATGTAATGCAGTCACGGTCCTTTTTAAACACTTTGTGTATGAAACCAGGAAAGCTC 

ACTATTGTATTAGGAATAGTTCCACATTGCTGCTGTTAACAGATATCATAAACCCAGTGG 

TTTGAGACGACACACACACACACACACACACACACACACAGAGAGAGAGAGAGTTCTGTA 

CATCAAGTGTGATCCAGGCTCTCACTAGATTAATACCCAGGCTAAGTTCCTTTCTGGAAG 

CTGGGACTTACCTCCTGCTCCTTCAAGCTATTGGCAGAACTCACTTCCCTGCAATGGTAA 

GGCAGAAATCCCTATTTTCTCAACAGCTGCCAACTAAGAACCCCTCTCAGCTTCTAGAGG 
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CCACCAACTTTTCTTAGTTCTTCTTTCTCCCCCCTCAAGACCAGCAGCGTCAAGTTGAAT 
CTTTGTCCTGGGCTAGCTGACTGGCTTGCCACTGCTGGGAAGAGTTGGGGCCTTTTGTGA 
GTAGGTTGGACCCACCAGGATAACCGAGGATGATCCCCTTCTCAGGGTCTATAGATGAAC 
CACACCTGCGCAGTTCCTTCTGCTGTCATCCTGGGCTTTGGTGCTTGGAGAACAGCCGTG 
GGCGGTGGGTGTTGTTACTGTGGTACCTACCATGCCATCTTAACCGAAACCAAGACCTAA 
AATAAAACAGATTTGTCATGGGACATCTAATAAATTAAATGAACTCTG 

SEQ ID NO: 63_NEK7_H, N34132_H 

CACGAATCCGAGCCCGCTCGCCTCTCTCCAGCGAACCGACCATGTCTGGCGGCGCCGCAG 

AGAAGCAGAGCAGCACTCCCGGTTCCCTGTTCCTCTCGCCGCCGGCTCCTGCCCCCAAGA 

ACGGCTCCAGCTCCGATTCCTCCGTGGGGGAGAAACTGGGAGCCGCGGCCGCCGACGCTG 

TGACCGGCAGGACCGAGGAGTACAGGCGCCGCCGCCACACTATGGACAAGGACAGCCGTG 

GGGCGGCCGCGACCACTACCACCACTGAGCACCGCTTCTTCCGCCGGAGCGTCATCTGCG 

ACTCCAATGCCACTGCACTGGAGCTTCCCGGCCTTCCTCTTTCCCTGCCCCAGCCCAGCA 

TCCCCGCGGCTGTCCCGCAGAGTGCTCCACCGGAGCCCCACCGGGAAGAGACCGTGACCG 

CCACCGCCACTTCCCAGGTAGCCCAGCAGCCTCCAGCCGCTGCCGCCCCTGGGGAACAGG 

CCGTCGCGGGCCCTGCCCCCTCGACTGTCCCCAGCAGTACCAGCAAAGACCGCCCAGTGT 

CCCAGCCTAGCCTTGTGGGGAGCAAAGAGGAGCCGCCGCCGGCGAGAAGTGGCAGCGGCG 

GCGGCAGCGCCAAGGAGCCACAGGAGGAACGGAGCCAGCAGCAGGATGATATCGAAGAGC 

TGGAGACCAAGGCCGTGGGAATGTCTAACGATGGCCGCTTTCTCAAGTTTGACATCGAAA 

TCGGCAGAGGCTCCTTTAAGACGGTCTACAAAGGTCTGGACACTGAAACCACCGTGGAAG 

TCGCCTGGTGTGAACTGCAGGATCGAAAATTAACAAAGTCTGAGAGGCAGAGATTTAAAG 

AAGAAGCTGAAATGTTAAAAGGTCTTCAGCATCCCAATATTGTTAGATTTTATGATTCCT 

GGGAATCCACAGTAAAAGGAAAGAAGTGCATTGTTTTGGTGACTGAACTTATGACGTCTG 

GAACACTTAAAACGTATCTGAAAAGGTTTAAAGTGATGAAGATCAAAGTTCTAAGAAGCT 

GGTGCCGTCAGATCCTTAAAGGTCTTCAGTTTCTTCATACTCGAACTCCACTTATCATTC 

ACCGCGATCTTAAATGTGACAACATCTTTATCACCGGCCCTACTGGCTCAGTCAAGATTG 

GAGACCTCGGTCTGGCAACCCTGAAGCGGGCTTCTTTTGCCAAGAGTGTGATAGGTACCC 

CAGAGTTCATGGCCCCTGAGATGTATGAGGAGAAATATGATGAATCCGTTGACGTTTATG 

CTTTTGGGATGTGCATGCTTGAGATGGCTACATCTGAATATCCTTACTCGGAGTGCCAAA 

ATGCTGCGCAGATCTACCGTCGCGTGACCAGTGGGGTGAAGCCAGCCAGTTTTGACAAAG 

TAGCAATTCCTGAAGTGAAGGAAATTATTGAAGGATGCATACGACAAAACAAAGATGAAA 

GATATTCCATCAAAGACCTTTTGAACCATGCCTTCTTCCAAGAGGAAACAGGAGTACGGG 

TAGAATTAGCAGAAGAAGATGATGGAGAAAAAATAGCCATAAAATTATGGCTACGTATTG 

AAGATATTAAGAAATTAAAGGGAAAATACAAAGATAATGAAGCTATTGAGTTTTGTTTTG 

ATTTAGAGAGAGATGTCCCAGAAGATGTTGCACAAGAAATGGTAGAGTCTGGGTATGTCT 

GTGAAGGTGATCACAAGACCATGGCTAAAGCTATCAAAGACAGAGTATCATTAATTAAGA 

GGAAACGAGAGCAGCGGCAGTTGGTACGGGAGGAGCAAGAAAACAAAAAGCAGGAAGAGA 

GCAGTCTCAAACAGCAGGTAGAACAATCCAGTGCTTCCCAGACAGGAATCAAGCAGCTCC 

CTTCTGCTAGCACCGGCATACCTACTGCTTCTACCACTTCAGCTTCAGTTTCTACACAAG 

TAGAACCTGAAGAACCTGAGGCAGATCAACATCAACAACTACAGTACCAGCAACCCAGTA 

TATCTGTGTTATCTGATGGGACGGTTGACAGTGGTCAGGGATCCTCTGTCTTCACAGAAT 

CTCGAGTGAGCAGCCAACAGACAGTTTCATATGGGTTCCCAANNCATGAACAGGCACATT 

CTACAGGCACAGTCCCAGGGCATATACCTTCTACTGTCCAAGCACAGTCTCAGCCCCATG 

GGGTATATCCACCCTCAAGTGTGCAGCAGGGAATACAGCAGACAGCCCCTCCTCAACAGA 

CAGTGCAGTATTCACTTTCACAGACATCAACCTCCAGTGAGGCCACTACTGCACAGCCAG 

TGAGTCAGCCTCAAGCTCCACAAGTCTTGCCTCAAGTATCAGCTGGAAAACAGAGTACTC 

AGGGAGTCTCTCAGGTTGCTCCTGCAGAGCCAGTTGCAGTAGCACAGCCCCAAGCTACCC 

AGCCGACCACTTTGGCTTCCTCTGTAGACAGTGCACATTCAGATGTTGCTTCAGGTATGA 

GTGATGGCAATGAGAACGTCCCATCTTCCAGTGGAAGGCATGAAGGAAGAACTACAAAAC 
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GGC^TTACCGAAAATCTGTAAGGAGTCGCTCTCGACATGAAAAAACTTCACGCCCAAAAT 

TAAGAATTTTGAATGTTTCAAATAAAGGAGACCGAGTAGTAGAATGTCAATTAGAGACTC 

ATAATAGGAAAATGGTTACATTCAAATTTGACCTAGATGGTGACAACCCCGAGGAGATAG 

CAACAATTATGGTGAACAATGACTTTATTCTAGCAATAGAGAGAGAGTCGTTTGTGGATC 

AAGTGCGAGAAATTATTGAAAAAGCTGATGAAATGCTCAGTGAGGATGTCAGTGTGGAAC 

CAGAGGGTGATCAGGGATTGGAGAGTCTACAAGGAAAGGATGACTATGGCTTTTCAGGTT 

CTCAGAAATTGGAAGGAGAGTTCAAACAACCAATTCCTGCGTCTTCCATGCCACAGCAAA 

TAGGCATTCCTACCAGTTCTTTAACTCAAGTTGTTCATTCTGCGGGAAGGCGGTTTATAG 

TGAGTCCTGTGCCAGAAAGCCGATTACGAGAATCAAAAGTTTTCCCCAGTGAAATAACAG 

ATACAGTTGCTGCCTCTACAGCTCAGAGCCCTGGAATGAACTTGTCTCACTCTGCATCAT 

CCCTTAGTCTACAACAGGCCTTTTCTGAACTTAGACGTGCCCAAATGACAGAAGGACCCA 

ATACAGCACCTCCAAACTTTAGTCATACAGGACCAACATTTCCAGTAGTACCTCCTTTCT 

TAAGTAGCATTGCTGGAGTCCCAACCACAGCAGCAGCCACAGCACCAGTCCCTGCAACAA 

GCAGCCCTCCTAATGACATTTCCACATCAGTAATTCAGTCTGAGGTTACAGTGCCCACTG 

AAGAGGGGATTGCTGGAGTTGCCACCAGCACAGGTGTGGTAACTTCAGGTGGTCTCCCCA 

TACCACCTGTGTCTGAATCACCAGTACTTTCCAGCGTAGTTTCAAGTATCACAATACCTG 

CAGTTGTCTCAATATCTACTACATCCCCGTCACTTCAAGTCCCCACATCCACATCTGAGA 

TCGTTGTTTCTAGTACAGGACTGTATCCTTCAGTAACAGTTTCAGCAACTTCAGCCTCTG 

CAGGGGGCAGTACTGCTACCCCAGGTCCTAAGCCTCCAGCTGTAGTATCTCAGCAGGCAG 

CAGGCAGCACTACTGTGGGAGCCACATTAACATCAGTTTCTACCACCACTTCATTCCCAA 

GCACAGCTTCACAGCTGTCCATTCAGCTTAGCAGCAGTACTTCTACTCCTACTTTAGCTG 

AAACCGTGGTAGTTAGCGCACACTCACTAGATAAGACATCTCATAGCAGTACAACTGGAT 

TGGCTTTCTCCCTCTCTGCACCATCTTCCTCTTCCTCTCCTGGAGCAGGAGTGTCTAGTT 

ATATTTCTCAGCCTGGTGGGCTGCATCCTTTGGTCATTCCATCAGTGATAGCTTCTACTC 

CTATTCTTCCCCAAGCAGCAGGACCTACTTCTACACCTTTATTACCCCAAGTACCTAGTA 

TCCCACCCTTGGTACAGCCTGTTGCCAATGTGCCTGCTGTACAGCAGACACTAATTCATA 

GTCAGCCTCAACCAGCTTTGCTTCCCAACCAGCCCCATACTCATTGTCCTGAAGTAGATT 

rTGATACACAACCCAAAGCTCCTGGAATTGATGACATAAAGACTCTAGAAGAAAAGCTGC 

GGTCTCTGTTCAGTGAACACAGCTCATCTGGAGCTCAGCATGCCTCTGTCTCACTGGAGA 

CCTCACTAGTCATAGAGAGCACTGTCACACCAGGCATCCCAACTACTGCTGTTGCACCAA 

GCAAACTCCTGACTTCTACCACAAGTACTTGCTTACCACCAACCAATTTACCACTAGGAA 

CAGTTGCTTTGCCAGTTACACCAGTGGTCACACCTGGGCAAGTTTCTACCCCAGTCAGCA 

CTACTACATCAGGAGTGAAACCTGGAACTGCTCCCTCCAAGCCACCTCTAACTAAGGCTC 

CGGTGCTGCCAGTGGGTACTGAACTTCCAGCAGGTACTCTACCCAGCGAGCAGCTGCCAC 

CTTTTCCAGGACCTTCTCTAACCCAGTCCCAGCAACCTCTAGAGGATCTTGATGCTCAAT 

TGAGAAGAACACTTAGTCCAGAGATGATCACAGTGACTTCTGCGGTTGGTCCTGTGTCCA 

TGGCGGCTCCAACAGCAATCACAGAAGCAGGAACACAGCCTCAGAAGGGTGTTTCTCAAG 

TCAAAGAAGGCCCTGTCCTAGCAACTAGTTCAGGAGCTGGTGTTTTTAAGATGGGACGAT 

TTCAGGTTTCTGTTGCAGCAGACGGTGCCCAGAAAGAGGGTAAAAATAAGTCAGAAGATG 

CAAAGTCTGTTCATTTTGAATCCAGCACCTCAGAGTCCTCAGTGCTATCAAGTAGTAGTC 

CAGAGAGTACCTTGGTGAAACCAGAGCCGAATGGCATAACCATCCCTGGTATCTCTTCAG 

Sgtcc?Lagagtgcccacaaaactactgcctcagaggcaaagtcagacactg 

CTACCAAGGTTGGACGTTTTCAGGTGACAACTACAGCAAACAAAGTGGGTCGTTTCTCTG 
TATCAAAAACTGAGGACAAGATCACTGACACAAAGAAAGAAGGACCAGTGGCATCTCCTC 

CTTTTATGGATTTGGAACAAGCTGTTCTTCCTGCT 

AACTGTCAGAGCCTTCACATCTAAATGGGCCGTCTTCTGACCCGGAGGCCGCTTTTTTAA 

SSgggatgtggatgatggttccggtagtccacactcgccccatcagctgagctcaaaga 

gotcctaLcagaatctaagtcaaagccttagtaattcattta^ 

gtagcgacaatgagtcagatatcgaagatgaagacttaaagttagagctgcgacgactac 

Sgataaacatctcaaagagattcaggacctgcagagtcgccagaagcatgaaattgaat 
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CTTTGTATACCAAACTGGGCAAGGTGCCCCCTGCTGTTATTATTCCCCCAGCTGCTCCCC 

TTTCAGGGAGAAGACGACGACCCACTAAAAGCAAAGGCAGCAAATCTAGTCGAAGCAGTT 

CCTTGGGGAATAAAAGCCCCCAGCTTTCAGGTAACCTGTCTGGTCAGAGTGCAGCTTCAG 

TCTTGCACCCCCAGCAGACCCTCCACCCTCCTGGCAACATCCCAGAGTCCGGGCAGAATC 

AGCTGTTACAGCCCCTTAAGCCATCTCCCTCCAGTGACAACCTCTATTCAGCCTTCACCA 

GTGATGGTGCCATTTCAGTACCAAGCCTTTCTGCTCCAGGTCAAGGTAATAAAGCAACCA 

TCATCGTCCAAAAACAATAAAATGGAGATGTTGCCATACCTGGGACAAAAGCCTGTTAAG 

GCGGGTTGGGAGACTAGCTGACCAGAACACAGCCTGTGTGTTGTACACTGAAGAATCTGG 

GTGAAAAGGGAAGTGGAGTGATAATGAGAATCGGTGGGCTCACTGCTCCCATTAGGTGAA 

ATTACTTTTTTTCAAGGAATTACAGTGAAAAGTTACATCTGTGTGGCCTATATGACTTGC 

TCATTTGGGATTTGGAACTTAGGCTTTAATATTAGGCTGAGATTTCCTGGATGAAATTCT 

AAGGTGTTTTAGCAGTTTCTGAAGCTAATACATTTTCTTAGCCATTGTAGAATTTTGTTA 

CTTTTAAGTATGGGAGTGGCATACTAAAATGAATAACCTTACAATTCAGTTTTTTATCCA 

TAATCTACTTTCCAAATATAGCTCTGTTTATTAGTGATTGCTGAAAAAATTCCCACAGAG 

GAAAGAGCTTTTAGTCATATTAGAACAAGAATTGAAAAGACTTGGGCATCTGGGTGAGAA 

GAATGAAAAAAATATAGGTACTGGCTTATGTGCCTTTGCCACAGTTTCACAGAAATTAGA 

GATCAGTCTCTTCACAGGAAGAATGCACTTGATTGGTAAGGAGGGCAAACTAGCTAGCAT 

TATTCGAACTAAGAAAAGCTTCCGCATTTTGCAGATGGGTAGAATTAAGACCTAATATTT 

CATCTCTTACATATCTGACCTTCCCCCCAGAAGCTTGTTCTTCTGTGTGCCATCTTAGTG 

CATTTCACCACTCCAGCCTCAAGTTTCTAACATCTTGTAGTTGTGTTCTGTCTCTTCTCC 

TCTCTCTGTTCTACCCTGTTTTTCCCCTCTCACAGGCTGTGCGAAGTTTAACTGTGCATC 

TGAACAGGTGACATTCAAACCTGGTGGCAGGAGGACCCGATTTCTGAGTACGCCCTGCTT 

GGCTCTTTGTGTGTAACACCTTTACTCCTTCCTTGTCCTTGTGTTTCTGCTGCTTGGATC 

TGATGTTTCACGCAGTCCATTTTCATTTGTCTCTTTTTGTATATCATCTACTCAGTGGCT 

TGGCTGAATTACTGTTACCCTCAGAAGTTTGGGCCCCCACATTAATTATGATAAAAAATG 

TCAAAATAACAAGTTATCTACAAATTTCAATGTAACTTTCTGGTAGAAGTGCTTCTTCAT 

GGATCTGTGACAGAGAGTGGATATGGTATCTAGGCAATAGATTGCTGGGTCATTTAGAAT 

AATGAAGACTGAACTCCACAGTCGTAGTCAGTGCTGTCTGTCTGCCCTAGCATTAGAAAT 

GAGAGAAATCAGCCAGACACGGTGGCGTACACCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGCGGGAAGATTGCTTGAGGCCAGGAGCTCGAGACCAACCCTGGGCAACATGGTGATACC 

CCATCTCT 

SEQ ID NO: 64_BCON3_H 

GCGGAGCGCAGCTGTGAGGGAGTCGCTGTGATCCGGGGCCCCGGAACCCGAGCTGGAGCT 
GAAGCGCAGGCTGCGGGGCGCGGAGTCGGGAGGCCTGAGTGTTCCTTCCAGCATGTCGGA 
GGGGGAGTCCCAGACAGTACTTAGCAGTGGCTCAGACCCAAAGGTAGAATCCTCATCTTC 
AGCTCCTGGCCTGACATCAGTGTCACCTCCTGTGACCTCCACAACCTCAGCTGCTTCCCC 
AGAGGAAGAAGAAGAAAGTGAAGATGAGTCTGAGATTTTGGAAGAGTCGCCCTGTGGGCG 
CTGGCAGAAGAGGCGAGAAGAGGTGAATCAACGGAATGTACCAGGTATTGACAGTGCATA 
CCTGGCCATGGATACAGAGGAAGGTGTAGAGGTTGTGTGGAATGAGGTACAGTTCTCTGA 
ACGCAAGAACTACAAGCTGCAGGAGGAAAAGGTTCGTGCTGTGTTTGATAATCTGATTCA 
ATTGGAGCATCTTAACATTGTTAAGTTTCACAAATATTGGGCTGACATTAAAGAGAACAA 
GGCCAGGGTCATTTTTATCACAGAATACATGTCATCTGGGAGTCTGAAGCAATTTCTGAA 
GAAGACCAAAAAGAACCACAAGACGATGAATGAAAAGGCATGGAAGCGTTGGTGCACACA 
AATCCTCTCTGCCCTAAGCTACCTGCACTCCTGTGACCCCCCCATCATCCATGGGAACCT 
GACCTGTGACACCATCTTCATCCAGCACAACGGACTCATCAAGATTGGCTCTGTGGCTCC 
TGACACTATCAACAATCATGTGAAGACTTGTCGAGAAGAGCAGAAGAATCTACACTTCTT 
TGCACCAGAGTATGGAGAAGTCACTAATGTGACAACAGCAGTGGACATCTACTCCTTTGG 
CATGTGTGCACTGGAGATGGCAGTGCTGGAGATTCAGGGCAATGGAGAGTCCTCATATGT 
GCCACAGGAAGCCATCAGCAGTGCCATCCAGCTTCTAGAAGACCCATTACAGAGGGAGTT 
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CATTCAAAAGTGCCTGCAGTCTGA^ 
CCACCCAGCATTGTTTGAAGTGCCCTCGCTCAAA^ 

ACACCAACACATGATCCCAGAGAACGC^ 
TGCCGTACTGGCTGAAATCCCTGCAGGACCAGGAA^ 

TCAGTCACCAGCTCTGGAATTAGA^ 
TCTGACAGCCTTTGGGCTGCCTCGGCCCCAGCAGCCAC^ 

TGTCGTGCCCCCCTCTGTCAAGACTCCGACACCT^ 



CCAGAGCCGGTTGACTTCTCTGCTAGAAGAGACCTTGAA^C 
CAGTACCCTCAACTCAGCCGCTGTCACCGTCTCCTCTTA^ 

TGATCTGCGCTGTGGCTGTCCCTGG^ 

AGTATTACCCTGTGAAGCCCCTTCCCTCC111A TCT GCACTTTGTTTACT 
TGGTTCTGAGCATCATCCTTTCCCCTCCCCTCTCTTCC^ 

^^^^^^^^^ 

GCT 

SEQ ID NO: 65_AA711829 M GACTAT GAATGAAAAGGCTTGG 

M(^CCTACACTTTTTTGCACCRGMTSTGGftGMGTCS TT CAGGGCAAT 
GACATCTACTCCTTTGGCATGTGTGCACTGGAGATGGCAGTGCT CTAGAMAC 
^CGAGTCCTCATATGTGC«CAG^GCa^G^TGCCATCC 

GAGGAGGTGACATCACCTGTTGTGCCCCCCT^ 

GAAGTGGAGACACGAAAGGTGGTGCTGAT^ 

GTCAAACACCATCTAACACTTCTGCTGAAGCTGGAG^ 

TGTGACCTGATGCCAAATGAGAGCATC^ 
TTCATTAGTGAGGCTGATCAGAGCCGCCTGACTTCT 

TTTTTGCT 
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SEQ ID NO: 66_AA099102_H 

ATGTCATCATGTGTCTCTAGCCAGCCCAGCAGCAACCGGGCCGCCCCCCAGGATGAGCTG 

GGGGGCAGGGGCAGCAGCAGCAGCGAAAGCCAGAAGCCCTGTGAGGCCCTGCGGGGCCTC 

TCATCCTTGAGCATCCACCTGGGCATGGAGTCCTTCATTGTGGTCACCGAGTGTGAGCCG 

GGCTGTGCTGTGGACCTCGGCTTGGCGCGGGACCGGCCCCTGGAGGCCGATGGCCAAGAG 

GTCCCCCTTGACACCTCCGGGTCCCAGGCCCGGCCCCACCTCTCCGGTCGCAAGCTGTCT 

CTGCAAGAGCGGTCCCAGGGTGGGCTGGCAGCCGGTGGCAGCCTGGACATGAACGGACGC 

TGCATCTGCCCGTCCCTGCCCTACTCACCCGTCAGCTCCCCGCAGTCCTCGCCTCGGCTG 

CCCCGGCGGCCGACAGTGGAGTCTCACCACGTCTCCATCACGGGTATGCAGGACTGTGTG 

CAGCTGAATCAGTATACCCTGAAGGATGAAATTGGAAAGGGCTCCTATGGTGTCGTCAAG 

TTGGCCTACAATGAAAATGACAATACCTACTATGCAATGAAGGTGCTGTCCAAAAAGAAG 

CTGATCCGGCAGGCCGCTTTTCCACGTCGCCCTCCACCCCGAGGCACCCGGCCAGCTCCT 

GGAGGCTGCATCCAGCCCAGGGGCCCCATTGAGCAGGTGTACCAGGAAATTGCCATCCTC 

AAGAAGCTGGACCACCCCAATGTGGTGAAGCTGGTGGAGGTCCTGGATGACCCCAATGAG 

GACCATCTGTACATGGTGTTCGAACTGGTCAACCAAGGGCCCGTGATGGAAGTGCCCACC 

CTCAAACCACTCTCTGAAGACCAGGCCCGTTTCTACTTCCAGGATCTGATCAAAGGCATC 

GAGTACTTACACTACCAGAAGATCATCCACCGTGACATCAAACCTTCCAACCTCCTGGTC 

GGAGAAGATGGGCACATCAAGATCGCTGACTTTGGTGTGAGCAATGAATTCAAGGGCAGT 

GACGCGCTCCTCTCCAACTACGTGGGCACGCCCGCCTTCATGGCTCCCGAGTCGCTCTCT 

GAGACCCGCAAGATCTTCTCTGGGAAGGCCAAGGATGTTTGGGCCATGGGTGTGACACTA 

TACTGCTTTGTCTTTGGCCAGTGCCCATTCATGGACGAGCGGATCATGTGTTTACACAGT 

AAGATCAAGAGTCAGGCCCTGGAATTTCCAGACCAGCCCGACATAGCTGAGGACTTGAAG 

GACCTGATCACCCGTATGCTGGACAAGAACCCCGAGTCGAGGATCGTGGTGCCGGAAATC 

AAGCTGCACCCCTGGGTCACGAGGCATGGGGCGGAGCCGTTGCCGTCGGAGGATGAGAAC 

TGCACGCTGGTCGAAGTGACTGAAGAGGAGGTCGAGAACTCAGTCAAACACATTCCCAGC 

TTGGCAACCGTGATCCTGGTGAAGACCATGATACGTAAACGCTCCTTTGGGAACCCATTC 

GAGGGCAGCCGGCGGGAGGAACGCTCACTGTCAGCGCCTGGAAACTTGCTCACCAAAAAA 

CCAACCAGGGAATGTGAGTCCCTGTCTGAGCTCAAGGAAGCAAGGCAGCGAAGACAACCT 

CCAGGGCACCGACCCGCCCCCCGTGGGGGAGGAGGAAGTGCTCTTGTGAGAGGCAGTCCC 

TGCGTGGAAAGTTGCTGGGCCCCCGCCCCCGGCTCCCCCGCACGCATGCATCCACTGCGG 

CCGGAGGAGGCCATGGAGCCCGAGTAG 

SEO ID NO: 67 5R69 17 2 H ' „„ 

CCGGGATGTGAGCCTGGTGGTTGGCAGCTGGAGCCACGTCGGAGGGGGAAGTGTCGCAGC 

ATTCTCTGCAGGCATCACAGACCTGAGGCAGTGGCCTCCGGAGGGCACTGGACAGAAACA 

GCCATCCAAGTGGCTGAGTGGAGGGACCCTGCTCAAGTGCAGCTGCAGTGGCCGGGGTTT 

CCCTCAGGTAGGGATCGGGGCGCCTTGTCGCCGCCAGCCACGTGTGGCGTCCGGTACAGT 

CAGCAGAGTGCAGGGTGCGGGCACCAGGAAAGGGGGCGCAGGGGAACTCCCGCGGGCCTC 

rCGTTTGCAAACTTCTCGCCTGGGCAGGAGGCGGTCGTGGGAAAGAAGGTGGAAGAGCGA 

GCTTTTTGGAACTGTGCACGGGACAGATTGGACGCACACCCCTCGGGAGGCGCGAAGGCA 

TGGAAAATTTGAAGCATATTATCACCCTTGGCCAGGTCATCCACAAACGGTGTGAAGAGA 

TGAAATACTGCAAGAAACAGTGCCGGCGCCTGGGCCACCGCGTCCTCGGCCTGATCAAGC 

CTCTGGAGATGCTCCAGGACCAAGGAAAGAGGAGCGTGCCCTCTGAGAAGTTAACCACAG 

CCATGAACCGCTTCAAGGCTGCCCTGGAGGAGGCTAATGGGGAGATAGAAAAGTTCAGCA 

ATAGATCCAATATCTGCAGGTTTCTAACAGCAAGCCAGGACAAAATACTCTTCAAGGACG 

TGAACAGGAAGCTGAGTGATGTCTGGAAGGAGCTCTCGCTGTTACTTCAGGTTGAGCAAC 

GCATGCCTGTTTCACCCATAAGCCAAGGAGCGTCCTGGGCACAGGAAGATCAGCAGGATG 

Sgacgaagacaggcgagctttccagatgctaagaagagataatgaaaaaatagaagctt 
SSgagacgattagaaatcaacatgaaagaaatc^ 

tatcatgtgccctgctgtttctgatggcccccaaactagaagtcatcagtttactgggac 
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CCCAGCCTCCCGCTACCCCTGCATTTGTCCATTTTCTGTGCTGGATGGCTGGAAGCAGCC 

CACAGGTTTGGGGATCCATTCATGGCTAGCCCAGGCTTCTGTCCATGGAATAACATGTGG 

AGAGAGCTTCTTGACCAGTAAGATACCTTCTAGCAGCTGTCAAAGTACTTAAAAACCTCT 

ATGAATAGAATCAAAGCTTCAGTTCAGTTGCTGAATTTCCAAGAAGAAATTCAAATCAAA 

TTTAAAATGCCCACTCATTCATTCATTCAACAAAACTGTGAGTATCTGGTTTATGCCAGA 

GGCCATGCAAAGAGGTAACTAAGATGCAGAGAAGGACACTGCCTTCCAGGAGCTCACGGG 

GTGGAGGAGGAAAGAGGAAAGACAGACAGTGAACACACAACAGCAAGGTTACTGAGCTTG 

AACTATGTCCCTAACTACTAGATCTGAAATGACTACGCCAGATGCCAGATGCTCAAGTGC 

CAAGCTCTGGGTAACAGGAATAGACATCCTTCCAGGATGAGAGAGATGAGTCTGGATGAG 

GGTTAAGGCTGGAGGGACAGGCGGGATTTGAAGAGGAGGGAAAGGAAGTGGATGACACAT 

TCTGTTAACTGTCCAGCTGTGTCTCTACTGGTCACTCAGAGGCACGGGAGCCGCTCCCTT 

GGGCTGAGTCCATCAGAAGCCCCAGCCACCACCAGCTCTGGTTCATGTAGTAGAGCTTCC 

CACTCACACATCACAAATATGCCACCTCCCTTAGGACCCCTTCCTCTGCTCATTGACTCT 

TTTGTCTTCTTTCCTCTCGGGGGTGAGGTCAGATTTACCACCAAAATGCATGCAGGAGAT 

CCCGCAAGAGCAAATCAAGGAGATCAAGAAGGAGCAGCTTTCAGGATCCCCGTGGATTCT 

GCTAAGGGAAAATGAAGTCAGCACACTTTATAAAGGAGAATACCACAGAGCTCCAGTGGC 

CATAAAAGTATTCAAAAAACTCCAGGCTGGCAGCATTGCAATAGTGAGGCAGACTTTCAA 

TAAGGAGATCAAAACCATGAAGAAATTCGAATCTCCCAACATCCTGCGTATATTTGGGAT 

TTGCATTGATGAAACAGTGACTCCGCCTCAATTCTCCATTGTCATGGAGTACTGTGAACT 

CGGGACCCTGAGGGAGCTGTTGGATAGGGAAAAAGACCTCACACTTGGCAAGCGCATGGT 

CCTAGTCCTGGGGGCAGCCCGAGGCCTATACCGGCTAGACCATTCAGAAGCACCTGAACT 

CCACGGAAAAATCAGAAGCTCAAACTTCCTGGTAACTCAAGGCTACCAAGTGAAGCTTGC 

AGGATTTGAGTTGAGGAAAACACAGACTTCCATGAGTTTGGGAACTACGAGAGAAAAGAC 

AGACAGAGTCAAATCTACAGCATATCTCTCACCTCAGGAACTGGAAGATGTATTTTATCA 

ATATGATGTAAAGTCTGAAATATACAGCTTTGGAATCGTCCTCTGGGAAATCGCCACTGG 

AGATATCCCGTTTCAAGGTGAAGAATGTGAAGACTGGCTCAGCCAGTGGCTGTAATTCTG 

AGAAGATCCGCAAGCTGGTGGCTGTGAAGCGGCAGCAGGAGCCACTGGGTGAAGACTGCC 

CTTCAGAGCTGCGGGAGATCATTGATGAGTGCCGGGCCCATGATCCCTCTGTGCGGCCCT 

CTGTGGATGAAATCTTAAAGAAACTCTCCACCTTTTCTAAGTAGTGTATCAAAATCTAAA 

CCAAGGAGTCTCTGGACAAGAAGCTGGGAGAGGCACGAACTGGACATCTCTCTCTCTCAT 

ATCCTTCGGCATTGGGTTATCTATGGGTGCAAGGAGTGGGCACGCTTCTCTGTTACAAAT 

AGAAAACGATTCCAGTCATACAGGACACATCCCACTCCAAATGATATTTCCAAAAACATA 

CCTCTGACAGTAACTTTGATAGATGGTTTGTCAAATGTATCTTTCTGGGTATCCACACCT 

CTTGGCAATGAAATTTGCAGCTCCTCCCTTCCATAAATGAAGTCTCTTTCCCCACCATTT 

GAATCTGGGCTGGCACTGTGACTTGATTTGATCAATAGAATGTGGAAGAAGTGACTGTAT 

GCCAGTTCCAAGCCTAGGTTTCAAGAGGCCTTATAAATGTCTGTTGGAACCTTACCCAGC 

CATGGACATGTTGAGTGAGCATGCTGGAGAATGAGAGACCACATGAAGCAGAAACATGCT 

TTCCTAGCTGAAGTCATACTAGCCCAACCAACATGGCAGCTAACACATGAATGAGGCCAA 

TCAAGACCAGAAGAACCACTCAAGCAGATCCCAGCCCAAATTGCCCATTCACACAATCAG 

GAGCTAAATAAATTACTGTTGTCTTTT 
SEO ID NO: 68 H85811_H 

CGCCCGGCCCCCTCCCCCGGCGCCGGCCACGGGAGGCGGTGATGCGGGCGCGGGCGGCCT 
CGGCTGCGCCGAGAGCGGAGACACAGGCTCAAGATGGCAGATTCCGACTGAGGCTGGGGG 
GGCCGAGCTCGCGCGCCGCTTTCCCGTCCCCGTTGCCATGAACCGCGGACACCCCGGCCC 
CGATGGCCCCCGTGTACGAAGGTATGGCCTCACATGTGCAAGTTTTCTCCCCTCACACCC 
TTCAATCAAGTGCCTTCTGTAGTGTGAAGAAACTGAAAATAGAGCCGAGTTCCAACTGGG 
ACATGACTGGGTACGGCTCCCACAGCAAAGTGTATAGCCAGAGCAAGAACATCCCCCTGT 
CGCAGCCAGCCACCACAACCGTCAGCACCTCCTTGCCGGTCCCAAACCCAAGCCTACCTT 
ACGAGCAGACCATCGTCTTCCCAGGAAGCACCGGGCACATCGTGGTCACCTCAGCAAGCA 
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GCACTTCTGTCACCGGGCAAGTCCTCGGCGGACCACACAACCTAATGCGTCGAAGCACTG 

TGAGCCTCCTTGATACCTACCAAAAATGTGGACTCAAGCGTAAGAGCGAGGAGATCGAGA 

ACACAAGCAGCGTGCAGATCATCGAGGAGCATCCACCCATGATTCAGAATAATGCAAGCG 

GGGCCACTGTCGCCACTGCCACCACGTCTACTGCCACCTCCAAAAACAGCGGCTCCAACA 

GCGAGGGCGACTATCAGCTGGTGCAGCATGAGGTACTGTGCTCCATGACCAACACCTACG 

AGGTCTTAGAGTTCTTGGGCCGAGGGACGTTTGGGCAAGTGGTCAAGTGCTGGAAACGGG 

GCACCAATGAGATCGTAGCCATCAAGATCCTGAAGAACCACCCATCCTATGCCCGACAAG 

GTCAGATTGAAGTGAGCATCCTGGCCCGGTTGAGCACGGAGAGTGCCGATGACTATAACT 

TCGTCCGGGCCTACGAATGCTTCCAGCACAAGAACCACACGTGCTTGGTCTTCGAGATGT 

TGGAGCAGAACCTCTATGACTTTCTGAAGCAAAACAAGTTTAGCCCCTTGCCCCTCAAAT 

ACATTCGCCCAGTTCTCCAGCAGGTAGCCACAGCCCTGATGAAACTCAAAAGCCTAGGTC 

TTATCCACGCTGACCTCAAACCAGAGAACATCATGCTGGTGGATCCATCTAGACAACCAT 

ACAGAGTCAAGGTCATCGACTTTGGTTCAGCCAGCCACGTCTCCAAGGCTGTGTGCTCCA 

CCTACTTGCAGTCCAGATATTACAGGGCCCCTGAGATCATCCTTGGTTTACCATTTTGTG 

AGGCAATTGACATGTGGTCCCTGGGCTGTGTTATTGCAGAATTGTTCCTGGGTTGGCCGT 

TATATCCAGGAGATTCGGAGTATGATCAGATTCGGTATATTTCACAAACACAGGGTTTGC 

CTGCTGAATATTTATTAAGCGCCGGGACAAAGACAACTAGGTTTTTCAACCGTGACACGG 

ArTCACCATATCCTTTGTGGAGACTGAAGACACCAGATGACCATGAAGCAGAGACAGGGA 

TTAAGTCAAAAGAAGCAAGAAAGTACATTTTCAACTGTTTAGATGATATGGCCCAGGTGA 

ACATGACGACAGATTTGGAAGGGAGCGACATGTTGGTAGAAAAGGCTGACCGGCGGGAGT 

TCATTGACCTGTTGAAGAAGATGCTGACCATTGATGCTGACAAGAGAATCACTCCAATCG 

AAACCCTGAACCATCCCTTTGTCACCATGACACACTTACTCGATTTTCCCCACAGCACAC 

ACGTCAAATCATGTTTCCAGAACATGGAGATCTGCAAGCGTCGGGTGAATATGTATGACA 

CGGTGAACCAGAGCAAAACCCCTTTCATCACGCACGTGGCCCCCAGCACGTCCACCAACC 

TGACCATGACCTTTAACAACCAGCTGACCACTGTCCACAACCAGCCCTCAGCGGCATCCA 

TGGCTGCAGTGGCCCAGCGGAGCATGCCCCTGCAGACAGGAACAGCCCAGATTTGTGCCC 

GGCCTGACCCGTTCCAGCAAGCTCTCATCGTGTGTCCCCCCGGCTTCCAAGGCTTGCAGG 

CCTCTCCCTCTAAGCACGCTGGCTACTCGGTGCGAATGGAAAATGCAGTTCCCATCGTCA 

CTCAAGCCCCAGGAGCTCAGCCTCTTCAGATCCAACCAGGTCTGCTTGCCCAGCAGGCTT 

GGCCAAGTGGGACCCAGCAGATCCTGCTTCCCCCAGCATGGCAGCAACTGACTGGAGTGG 

CCACCCACACCTCAGTGCAGCATGCCACCGTGATTCCCGAGACCATGGCAGGCACCCAGC 

AGCTGGCGGACTGGAGAAATACGCATGCTCACGGAAGCCATTATAATCCCATCATGCAGC 

AGCCTGCACTATTGACCGGTCATGTGACCCTTCCAGCAGCACAGCCCTTAAATGTGGGTG 

TGGCCCACGTGATGCGGCAGCAGCCAACCAGCACCACCTCCTCCCGGAAGAGTAAGCAGC 

ACCAGTCATCTGTGAGAAATGTCTCCACCTGTGAGGTGTCCTCCTCTCAGGCCATCAGCT 

CCCCACAGCGATCCAAGCGTGTCAAGGAGAACACACCTCCCCGCTGTGCCATGGTGCACA 

GTAGCCCGGCCTGCAGCACCTCGGTCACCTGTGGGTGGGGCGACGTGGCCTCCAGCACCA 

CCCGGGAACGGCAGCGGCAGACAATTGTCATTCCCGACACTCCCAGCCCCACGGTCAGCG 

TCATCACCATCAGCAGTGACACGGACGAGGAGGAGGAACAGAAACACGCCCCCACCAGCA 

CTGTCTCCAAGCAAAGAAAAAACGTCATCAGCTGTGTCACAGTCCACGACTCCCCCTACT 

CCGACTCCTCCAGCAACACCAGCCCCTACTCCGTGCAGCAGCGTGCTGGGCACAACAATG 

rCAATGCCTTTGACACCAAGGGGAGCCTGGAGAATCACTGCACGGGGAACCCCCGAACCA 

TCATCGTGCCACCCCTGAAAACCCAGGCCAGCGAAGTATTGGTGGAGTGTGATAGCCTGG 

TGCCAGTCAACACCAGTCACCACTCGTCCTCCTACAAGTCCAAGTCCTCCAGCAACGTGA 

rrTCCACCAGCGGTCACTCTTCAGGGAGCTCATCTGGAGCCATCACCTACCGGCAGCAGC 

GGCCGGGCCCCCACTTCCAGCAGCAGCAGCCACTCAATCTCAGCCAGGCTCAGCAGCACA 

TCACCACGGACCGCACTGGGAGCCACCGAAGGCAGCAGGCCTACATCACTCCCACCATGG 

CCCAGGCTCCGTACTCCTTCCCGCACAACAGCCCCAGCCACGGCACTGTGCACCCGCATC 

TGGCTGCAGCCGCTGCCGCTGCCCACCTCCCCACCCAGCCCCACCTCTACACCTACACTG 

CGCCGGCGGCCCTGGGCTCCACCGGCACCGTGGCCCACCTGGTGGCCTCGCAAGGCTCTG 
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CGCGCCACACCGTGCAGCACACTGCCTACCCAGCCAGCATCGTCCACCAGGTCCCCGTGA 
GCATGGGCCCCCGGGTCCTGCCCTCGCCCACCATCCACCCGAGTCAGTATCCAGCCCAAT 
TTGCCCACCAGACCTACATCAGCGCCTCGCCAGCCTCCACCGTCTACACTGGATACCCAC 
TGAGCCCCGCCAAGGTCAACCAGTACCCTTACATATAAACACTGGAGGGGAGGGAGGGAG 
GGAGGGAGGGAGAGAATGGCCCGAGGGAGGAGGGAGAGAAGGAGGGAGGCGCTCCTGGGA 
CCGTGGGCGCTGGCCTTTTATACTGAAGATGCCGCACACAAACAATGCAAACGGGGCAGG 
GGCGGGGGGGGGGGGGCAGAGGGCAGGGGGACGGGTCGGGACACCAGTGAAACTTGAACC 
GGGAAGTGGGAGGACGTAGAGCAGAGAAGAGAACATTTTTAAAAGGAAGGGATTAAAGAG 

GGTGGGAAATCTATGGTTTTTATTTTAAAAAAG 
SEO ID NO: 69 DYRK3 H 

CGGGAGCGAAAGTGCGCTGAGCTGCAGTGTCTGGTCGAGAGTACCCGTGGGAGCGTCGCG 
CCGCGGAGGCAGCCGTCCCGGCGTAGGTGGCGTGGCCGACCGGACCCCCAACTGGCGCCT 
CTCCCCGAGCGGGGTCCCGAGCTAGGAGATGGGAGGCACAGCTCGTGGGCCTGGGCGGAA 
CGATGCGGGGCCGCCTGGGGCCGGGCTCCCGCCCCAGCAGCGGAGTTGGGGGATGGTGTC 

AATCCTTCTGAACCACCTCCACCCAGAAGACTAAATATGACCGCTGAGCAGTTTACAGGA 
CATCATACTCAGCACTTTTTGGATGGAGGTGAGATGAAGGTAGAACAGCTGTTTCAAGAA 
TTTGGCAACAGAAAATCCAATACTATTCAGTCAGATGGCATCAGTGACTCTGAAAAATGC 
TrTCCTACTGTTTCTCAGGGTAAAAGTTCAGATTGCTTGAATACAGTAAAATCCAACAGT 
TCATCCAAGGCACCCAAAGTGGTGCCTCTGACTCCAGAACAAGCCCTGAAGCT^TATAAA 
CACCACCTCACTGCCTATGAGAAACTGGAAATAATTAATTATCCAGAAATTTACTTTGTA 
GGTCCAAATGCCAAGAAAAGACATGGAGTTATTGGTGGTCCCAATAATGGAGGGTATGAT 
GATGCAGATGGGGCCTATATTCATGTACCTCGAGACCATCTAGCTTATCGATATGAGGTG 

ctgaaaattattggcaaggggagttttgggcaggtggccagggtctatgatcacaaactt 

CGACAGTACGTGGCCCTAAAAATGGTGCGCAATGAGAAGCGCTTTCATCGTCAAGCAGCT 
GAGGAGATCCGGATTTTGGAGCATCTTAAGAAACAGGATAAAACTGGTAGTATGAACGTT 

ScSSSSggaaagtttcacattccggaacca 

AGCATAGACCTTTATGAGCTGATTAAAAAAAATAAGTTTCAGGGTTTTAGCGTCCAGTTG 
GTACGCA^GTTTGCCCAGTCCATCTTGCAATCTTTGGATGCCCTCCACAAAAATAAGATT 
ATTCACTGCGATCTGAAGCCAGAAAACATTCTCCTGAAACACCACGGGCGCAGTTCAACC 
AAGGTCATTGACTTTGGGTCCAGCTGTTTCGAGTACCAGAAGCTCTACACATATATCCAG 
TCTCGGTTCTACAGAGCTCCAGAAATCATCTTAGGAAGCCGCTACAGCACACCAATTGAC 

ISSSgttttcgctgcatccttgcagaacttttaacaggacagcctctcttccctgga 
gIggatgaaggagaccagttggcctgcatgatggagcttctagggatgccaccaccaaa^ 

Stctggagcaatccaaacgtgccaagtactttattaattcc 
?gSSSgactacccaggcagatgggagggttgtgcttgtggggggtcgctcacgtagg 

ggtaaaaagcggggtcccccaggcagcaaagactgggggacagcactgaaagggtgtgat 

gS^gtttatagagttcttgaaaaggtgtcttcactgggacccctctgccc 

acScagctcaagcattaagacacccttggattagcaagtctgtccccagacctc 

accaSgLaaggtgtcagggaaacgggtagttaatcctgcaagtgctttccaggga^^^ 

rGTTCTAAGCTGCCTCCAGTTGTTGGAATAGCCAATAAGCTTAAAGCTAACTTAATGTCA 

gaaaccaatggtagtatacccctatgcagtgtattgccaaaactgattagctagtggaca 

GAGATATGCCCAGAGATGCATATGTGTATATTTTTATGATCTTACAAACCTGCAAATGGA 

Saaatgcaagcccattggtggatgtttttgttagagtagactttttttaaacaagacaa 

AAC^TTTT^TATGATTATAAAAGAATTCTTCAAGGGCTAATTACCTAACCAGCTTGTAT 
TGGCCATCTGGAATATGCATTAAATGACTTTTTATAGGTCA 
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SEQ ID NO: 70_AA58 924 1_M DYRK3_M 

CCACGCGTCCGGAGTTGCTAGGAATGCCACCGCAGAAACTTCTGGAGCAATCCAAGCGTG 
CCAAGTACTTTATTAACTCCAAAGGCTTGCCTCGATACTGCTCCGTATCTACCCAGACGG 
ACGGGAGGGTGGTGCTTCTCGGGGGTCGCTCACGCAGGGGTAAAAAGCGAGGCCCGCCAG 
GCAGCAAAGACTGGGCAACCGCACTGAAGGGCTGTGGTGACTACTTGTTCATAGAGTTTC 
TGAAACGATGCCTCCAGTGGGACCCCTCTGCCCGCCTCACCCCGGCTCAAGCATTAAGAC 
ATCCTTGGATTAGCT^AGTCTACACCCAAACCTCTCACCATGGACAAGGTGCCAGGGAAGC 
GGGTAGTTAACCCTACAAATGCTTTCCAGGGACTGGGTTCCAAGCTGCCTCCAGTCGTTG 
GGATAGCCAGTAAGCTTAAAGCTAACCTAATGTCCGAAACCAGTGGTAGTATACCTCTGT 
GCAGTGTATTGCCAAAGCTGATTAGCTAGTGGACCACTCAGAGACTGATACATATCATAT 
GTATTTTTAATTACCTTGCAAACATGCAAATGGAT^AACGGAATAATTGAAGCCCATTCAC 
TGATGGATATGTTTTTGTTAGACTTTTTTTTAACAAGGCAGAACATTTTTATATGACTAT 
AAAAGAACGCTTCAAGGGCTAATGTCAAACCAGCTTGTATTGGCCATCTGGAGTATACAT 
TAAATGACTTTTTCATAGGTC 

SEQ ID NO: 71_5R72_16_2_H 

GTCGAGGCGCAGCGCTGCCATGGCTGGGGGCCGTGGGGCCCCCGGGCGCGGCCGGGACGA 
GCCTCCGGAGAGCTACCCGCAACGACAGGACCACGAGCTACAGGCCCTGGAGGCCATCTA 
CGGCGCGGACTTCCAAGACCTGCGGCCGGACGCTTGCGGACCGGTCAAAGAGCCCCCTGA 
AATCAATTTAGTTTTGTACCCTCAAGGCCTAACTGGTGAAGAAGTATATGTAAAAGTGGA 
TTTGAGGGTTAAATGCCCACCTACCTATCCAGATGTAGTTCCTGAAATAGAGTTAAAAAA 
TGCCAAAGGTCTATCAAATGAAAGTGTCAATTTGTTAAAATCTCGCCTAGAAGAACTGGC 
CAAGAAACACTGTGGGGAGGTGATGATCTTTGAACTGGCTTACCACGTGCAGTCATTTCT 
CAGCGAGCATAACAAGCCCCCTCCCAAGTCTTTTCATGAAGAAATGCTGGAAAGGCGGGC 
TCAGGAGGAGCAGCAGAGGCTGTTGGAGGCCAAGCGGAAAGAAGAGCAGGAGCAACGTGA 
AATCCTGCATGAGATTCAGAGAAGGAAAGAAGAGATAAAAGAAGAGAAAAAAAGGAAAGA 
AATGGCTAAGCAGGAACGTTTGGAAATTGCTAGTTTGTCAAACCAAGATCATACCTCTAA 
GAAGGACCCAGGAGGACACAGAACGGCTGCCATTCTACATGGAGGCTCTCCTGACTTTGT 
AGGAAATGGTAAACATCGGGCAAACTCCTCAGGT^AGGTCTAGGCGAGAACGTCAGTATTC 
TGTATGTT^ATAGTGAAGATTCTCCTGGCTCTTGTGAAATTCTGTATTTCAATATGGGGAG 
TCCTGATCAGCTCATGGTGCACAAAGGGAAATGTATTGGCAGTGATGAACAACTTGGAAA 
ATTAGTCTACAATGCTTTGGAAACAGCCACTGGTGGCTTTGTCTTGTTGTATGAGTGGGT 
CCTTCAGTGGCAGA7^AAAATGGGTCCATTCCTTACCAGTCAAGAAAAAGAGAAGATTGA 
TAAGTGCAAAAAGCAGATTCAAGGAACAGAAACAGAATTCAACTCACTGGTAAAATTGAG 
CCATCCAAATGTAGTACGCTACCTTGCAATGAATCTCAAAGAGCAAGACGACTCCATCGT 
GGTGGACATTTTAGTGGAGCACATTAGTGGGGTCTCTCTTGCTGCACACCTGAGCCACTC 
AGGCCCCATCCCTGTGCATCAGCTTCGCAGGTACACAGCTCAGCTCCTGTCAGGCCTTGA 
TTATCTGCACAGCAATTCTGTGGTGCATAAGGTCCTGAGTGCATCTAATGTCTTGGTGGA 
TGCAGAAGGCACCGTCAAGATTACGGACTATAGCATTTCTAAGCGCCTCGCAGACATTTG 
CAAGGAGGATGTGTTTGAGCAAACCCGAGTTCGTTTTAGTGACAATGCTCTGCCTTATAA 
AACGGGGAAGAAAGGAGATGTTTGGCGTCTTGGCCTTCTGCTGCTGTCCCTCAGCCAAGG 
ACAGGAATGTGGAGAGTACCCTGTGACCATCCCTAGTGACTTACCAGCTGACTTTCAAGA 
TTTTCTAAAGAAATGTGTGTGCTTGGATGACAAGGAAAGATGGAGTCCCCAGCAGTTGTT 
GAAACACAGCTTTATAAATCCCCAGCCAAAAATGCCTCTAGTGGAACAAAGTCCTGAAGA 
TTCTGGAGGACAAGATTATGTTGAGACTGTTATTCCTAGCAACCGGCTACCCAGTGCTGC 
CTTCTTTAGTGAGACACAGAGACAGTTTTCCCGATACTTCATTGAGTTTGAAGAATTACA 
ACTTCTTGGTAAAGGAGCTTTTGGAGCTGTCATCAAGGTGCAGAACAAGTTGGACGGCTG 
CTGCTACGCAGTGAAGCGCATCCCCATCAACCCGGCCAGCCGGCAGTTCCGCAGGATCAA 
GGGCGAAGTGACACTGCTGTCACGGCTGCACCATGAGAACATTGTGCGCTACTACAACGC 
CTGGATCGAGCGGCACGAGCGGCCGGCGGGACCGGGGACGCCGCCCCCGGACTCCGGGCC 
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CCTGGCCAAGGATGACCGAGCTGCACGCGGGCAGCCGGCGAGCGACACAGACGGCCTGGA 

CAGCGTAGAGGCCGCCGCGCCGCCACCCATCCTCAGCAGCTCGGTGGAGTGGAGCACTTC 

GGGCGAGCGCTCGGCCAGTGCCCGTTTCCCCGCCACCGGCCCGGGCTCCAGCGATGACGA 

GGACGACGACGAGGACGAGCACGGTGGCGTCTTCTCCCAGTCCTTCCTGCCTGCTTCAGA 

TTCTGAAAGTGATATTATCTTTGACAATGAAGATGAGAACAGTAAAAGTCAGAATCAGGA 

TGAAGATTGCAATGAAAAGAATGGCTGCCATGAAAGTGAGCCATCAGTGACGACTGAGGC 

TGTGCACTACCTATACATCCAGATGGAGTACTGTGAGAAGAGCACTTTACGAGACACCAT 

TGACCAGGGACTGTATCGAGACACCGTCAGACTCTGGAGGCTTTTTCGAGAGATTCTGGA 

TGGATTAGCTTATATCCATGAGAAAGGAATGATTCACCGGGATTTGAAGCCTGTCAACAT 

TTTTTTGGATTCTGATGACCATGTGAAAATAGGTGATTTTGGTTTGGCGACAGACCATCT 

AGCCTTTTCTGCTGACAGCAAACAAGACGATCAGACAGGAGACTTGATTAAGTCAGACCC 

TTCAGGTCACTTAACTGGGATGGTTGGCACTGCTCTCTATGTAAGCCCAGAGGTCCAAGG 

AAGCACCAAATCTGCATACAACCAGAAAGTGGATCTCTTCAGCCTGGGAATTATCTTCTT 

TGAGATGTCCTATCACCCCATGGTCACGGCTTCAGAAAGGATCTTTGTTCTCAACCAACT 

CAGAGATCCCACTTCGCCTAAGTTTCCAGAAGACTTTGACGATGGAGAGCATGCAAAGCA 

GAAATCAGTCATCTCCTGGCTGTTGAACCACGATCCAGCAAAACGGCCCACAGCCACAGA 

GCTGCTCAAGAGTGAGCTGCTGCCCCCACCCCAGATGGAGGAGTCAGAGCTGCATGAAGT 

GCTGCACCACACGCTGACCAACGTGGATGGGAAGGCCTACCGCACCATGATGGCCCAGAT 

CTTCTCGCAGCGCATCTCCCCTGCCATCGATTACACCTATGACAGCGACATACTGAAGGG 

CAACTTCTCAATCCGTACAGCCAAGATGCAGCAGCATGTGTGTGAAACCATCATCCGCAT 

CTTTAAAAGACATGGAGCTGTTCAGTTGTGTACTCCACTACTGCTTCCCCGAAACAGACA 

AATATATGAGCACAACGAAGCTGCCCTATTCATGGACCACAGCGGGATGCTGGTGATGCT 

TCCTTTTGACCTGCGGATCCCTTTTGCAAGATATGTGGCAAGAAATAATATATTGAATTT 

AAAACGATACTGCATAGAACGTGTGTTCAGGCCGCGCAAGTTAGATCGATTTCATCCCAA 

AGAACTTCTGGAGTGTGCATTTGATATTGTCACTTCTACCACCAACAGCTTTCTGCCCAC 

TGCTGAAATTATCTACACTATCTATGAAATCATCCAAGAGTTTCCAGCACTTCAGGAAAG 

AAATTACAGTATTTATTTGAACCATACCATGTTATTGAAAGCAATACTCTTACACTGTGG 

GATCCCAGAAGATAAACTCAGTCAAGTCTACATTATTCTGTATGATGCTGTGACAGAGAA 

GCTGACGAGGAGAGAAGTGGAAGCTAAATTTTGTAATCTGTCTTTGTCTTCTAATAGTCT 

GTGTCGACTCTACAAGTTTATTGAACAGAAGGGAGATTTGCAAGATCTTATGCCAACAAT 

AAATTCATTAATAAAACAGAAAACAGGTATTGCACAGTTGGTGAAGTATGGCTTAAAAGA 

CCTAGAGGAGGTTGTTGGACTGTTGAAGAAACTCGGCATCAAGTTACAGGTCTTGATCAA 

TTTGGGCTTGGTTTACAAGGTGCAGCAGCACAATGGAATCATCTTCCAGTTTGTGGCTTT 

CATCAAACGAAGGCAAAGGGCTGTACCTGAAATCCTCGCAGCTGGAGGCAGATATGACCT 

GCTGATTCCCCAGTTTAGAGGGCCACAAGCTCTGGGGCCAGTTCCCACTGCCATTGGGGT 

CAGCATAGCTATAGACAAGATATCTGCTGCTGTCCTCAACATGGAGGAATCTGTTACAAT 

AAGCTCTTGTGACCTCCTGGTTGTAAGTGTTGGTCAGATGTCTATGTCCAGGGCCATCAA 

CCTAACCCAGAAACTCTGGACAGCAGGCATCACAGCAGAAATCATGTACGACTGGTCACA 

GTCCCAAGAGGAATTACAAGAGTACTGCAGACATCATGAAATCACCTATGTGGCCCTTGT 

CTCGGATAAAGAAGGAAGCCATGTCAAGGTTAAGTCTTTCGAGAAGGAAAGGCAGACAGA 

GAAGCGTGTGCTGGAGACTGAACTTGTGGACCATGTACTGCAGAAACTGAGGACTAAAGT 

CACTGATGAAAGGAATGGCAGAGAAGCTTCCGATAATCTTGCAGTGCAAAATCTGAAGGG 

GTCATTTTCTAATGCTTCAGGTTTGTTTGAAATCCATGGAGCAACAGTGGTTCCCATTGT 

GAGTGTGCTAGCCCCGGAGAAGCTGTCAGCCAGCACTAGGAGGCGCTATGAAACTCAGGT 

ACAAACTCGACTTCAGACCTCCCTTGCCAACTTACATCAGAAAAGCAGTGAAATTGAAAT 

TCTGGCTGTGGATCTACCCAAAGAAACAATATTACAGTTTTTATCATTAGAGTGGGATGC 

TGATGAACAGGCATTTAACACAACTGTGAAGCAGCTGCTGTCACGCCTGCCAAAGCAAAG 

ATACCTCAAATTAGTCTGTGATGAAATTTATAACATCAAAGTAGAAAAAAAGGTGTCTGT 

GCTATTTCTGTACAGCTATAGAGATGACTACTACAGAATCTTATTTTAACCCTAAAGAAC 

TGTCGTTAACCTCATTCAAACAGACAGAGGCTTATACTGGAATAATGGAATGTTGTACAT 
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TCATCATAATTTAAAATTAAATTCTAAGAAGAGGCTGGGTGCAGTGGCTCACACCTTTAA 
TCCCAGCACTTTGGGAAGCCAAGGCAGGAAGACTGCTTGAAACCAGGAGTTTGAGACCAG 

CCT 

SEQ ID NO: 73_R43524_H, HRI_H 

ATGCTGGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGACGGGGCTGGGGCT 
GTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACCCCGAATATGACGAA 
TCTGATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAACAGCCAACCTTCCCT 
TTTGCAGTTGCAAACCAACTCTTGCTGGTTTCTTTGCTGGAGCACTTGAGCCACGTGCAT 
GAACCAAACCCACTTCGTTCAAGACAGGTGTTTAAGCTACTTTGCCAGACGTTTATCAAA 
ATGGGGCTGTTGTCTTCTTTCACTTGTAGTGACGAGTTTAGCTCATTGAGACTACATCAC 
AACAGAGCTATTACACACTTAATGAGGTCTGCTAAAGAGAGAGTTCGTCAGGATCCTTGT 
GAGGATATTTCTCGTATCCAGAAAATCAGATCAAGGGAAGTAGCCTTGGAAGCACAAACT 
TCACGTTACTTAAATGAATTTGAAGAACTTGTCATCTTAGGAAAAGGTGGATACGGAAGA 
GTATACAAGGTCAGGAATAAATTAGATGGTCAGTATTATGCAATAAAAAAAATCCTGATT 
AAGGGTGCAACTAAAACAGTTTGCATGAAGGTCCTACGGGAAGTGAAGGTGCTGGCAGGT 
CTTCAGCACCCCAATATTGTTGGCTATCACACCGCGTGGATAGAACATGTTCATGTGATT 
CAGCCACGAGCAGACAGAGCTGCCATTGAGTTGCCATCTCTGGAAGTGCTCTCCGACCAG 
GAAGAGGACAGAGAGCAATGTGGTGTTAAAAATGATGAAAGTAGCAGCTCATCCATTATC 
TTTGCTGAGCCCACCCCAGAAAAAGAAAAACGCTTTGGAGAATCTGACACTGAAAATCAG 
AATAACAAGTCGGTGAAGTACACCACCAATTTAGTCATAAGAGAATCTGGTGAACTTGAG 
TCGACCCTGGAGCTCCAGGAAAATGGCTTGGCTGGTTTGTCTGCCAGTTCAATTGTGGAA 
CAGCAGCTGCCACTCAGGCGTAATTCCCACCTAGAGGAGAGTTTCACATCCACCGAAGAA 



TCTTCCG/~ 

CACATCCAGATGCAGCTGTGTGAGCTCTCGCTGTGGGATTGGATAGTCGAGAGAAACAAG 
CGGGGCCGGGAGTATGTGGACGAGTCTGCCTGTCCTTATGTTATGGCCAATGTTGCAACA 
AAAATTTTTCAAGAATTGGTAGAAGGTGTGTTTTACATACATAACATGGGAATTGTGCAC 
CGAGATCTGAAGCCAAGAAATATTTTTCTTCATGGCCCTGATCAGCAAGTAAAAATAGGA 
GACTTTGGTCTGGCCTGCACAGACATCCTACAGAAGAACACAGACTGGACCAACAGAAAC 
GGGAAGAGAACACCAACACATACGTCCAGAGTGGGTACTTGTCTGTACGCTTCACCCGAA 
CAGTTGGAAGGATCTGAGTATGATGCCAAGTCAGATATGTACAGCTTGGGTGTGGTCCTG 
CTAGAGCTCTTTCAGCCGTTTGGAACAGAAATGGAGCGAGCAGAAGTTCTAACAGGTTTA 
AGAACTGGTCAGTTGCCGGAATCCCTCCGTAAAAGGTGTCCAGTGCAAGCCAAGTATATC 
CAGCACTTAACGAGAAGGAACTCATCGCAGAGACCATCTGCCATTCAGCTGCTGCAGAGT 
GAACTTTTCCAAAATTCTGGAAATGTTAACCTCACCCTACAGATGAAGATAATAGAGCAA 
GAAAAAGAAATTGCAGAACTAAAGAAGCAGCTAAACCTCCTTTCTCAAGACAAAGGGGTG 

AGGGATGACGGAAAGGATGGGGGCGTGGGATGA 
SEO ID NO: 74 17000057519457_H 

CACAAGAGCCCTTCCTGCAGGGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGT 
AACCACTTACAGGCCGGAAGTGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCA 
GGATTGCCGAAGAAGTCCCAGGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAG 
AGACAGCTGATCGGTTGGAGCTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTA 
CGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGC 
rr AGCAGCCGCTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGT 
TCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACC 
GGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGC 
TCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTT 
CCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGT 
CCACTATGGAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGC 
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AGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACA 

TGCTCCTGAAACCCCCCCTGGAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTT 

TCATTTCAGCACTTCCAGAGGATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCC 

TCAGTACCCATCCCAACACTGAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCT 

CCTCCAAAAAGGCCAGGCCAGTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAA 

AGAGGTCCATGGTTGGGTAGAAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTT 

TCAAAGTAAATTTGAAGAAATGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAG 

ATATTTTTAAGTGGTATGTGATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACT 

ATGTGTCTAAGTCATGTTCTAGGCAGAATTGGGTATTTAAAGTAAATTGAGGACAGGCTT 

CTCCCAGATTGTGACATGTATATCTCAGATACATGGGTGTGGCATTGAACCACATAATGA 

GAACATTATTCTCTTTTTAGTCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTC 

GCTGAGCTTACTGGCCCTCTAACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTAT 

ATTTATTTTGAAACCAGTTTAATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATA 

CAGCATGGAAAATATCAGTGTATTGTTTTATGAAACTTTCACGTGTATATATAGACCAAG 

GATATGTGCTGAGTTTTGATGTCAAATATATTTCTCTTTCAGGGTCATGATCAAAAAATG 

AAAAGTCTGCTTAACTCCAATTTCTCTTTTAAAAAAGCAGACTTACAGCTTTCAGGCAAC 

TGAAATTCATGTTAACATGTTTTTATTTTTATTGCTTTGTATTTTTGTGGTTACCTTCTA 

AGACAAGTGATTGATCTAAAGT.TCCTTTTAAGTTTATACCGCTAAACAAACTGAGTTGAT 

TTCTATCACAGGCAGTAAGTAGGTAGAGCAAAAATGGTGAAGTGACTTGTGAAGACTGAA 

GTTTGATGAAGTCTGGTTTAAGGCACAGGTAAACTGAGTGTGGATGCAAAAGTACCAGGA 

GCTAGCTTTTAACCTTGCCCAGCCTCAGTTTCTTTTCTTAGAAGAAGCTATGTTTGGGTG 

GGAAGGGAAGAGAGGGATAAGAAAATACCTTTCTTCCTTGTAAACTCCAATCAACAAACA 

TATTTTGAGTGCCTTTTGTGTTCCTTGGCACCCTGTTGGGTATTGGGTACTTGGCACCCT 

GTTGGGTATTGGGTACAATGGTGAGCCAGACAGACACAGCGCCTGTCCTTTTGTAAGAAT 

ATTTATTTTTATAAAAAAGTATAAAGTATACAGTGGGATGTTTTGATATACATTATGAAA 

TGATTGCTACAGCTGAGCTAATTAACACCCATCACCTCACATAGTTACTGTCTTGTTTCT 

TAATATGGACATTTGCAGCTATGAATTTCCCTCTGCACACTGTTGTCATCACACACTCTC 

AGTTTTGGTATTTTGTGTTTTTGTTTTCATTCATCTCAAAGTATTTTCTAATTTCCCTTG 

TGATTTCTTCTTTGACCCCTTGATTGTTTAGAAATCTGTTAATTTCCACACATTTGTAAA 

TGTTCCAATTTTTCTTTTGTTATTGCCAGCTTCATTCCATTGTGTTCAGAGATGATACAG 

TCAGTGCCTGTTCTTATGAAGCAAACATTCTATAATAGTAGGACCAGTACCCTGTCTGTT 

TCATTCACCACAGTCAGCATGCCCCAAGTGCCCAGCATGGGGCGGATGGCCAGGAATGAG 

TGAAAACTTCCCTTCCTGGGTAGTTGTGACTAGTAGAGAGGAAAAATAATATAATTGCCT 

GCTTACTGCATGCCAGGCATTGGGCTGGGAATTTTTATATTGGATCTAAAATAACTCTTA 

AGTTAGGCATTATCCCCATTTTATAGATGGAGAAACTGGCCCCAAAAGGTGGGAACTTGT 

CCAAGACGTCACAGGTAGCAAGAGGTACTTTTACCTGGCTCCAAATCTGTGTTCTTTCCA 

CTGACAAATGAGATATGGGATATGGTGCATCTTTACAGTACTATAATAAGTATTGGCGTA 

TAACATTATTTTCAAGGAACTCCAAGGGCCACAGGAGCTGACAGGTTTTTCAATTAATAT 

TCCCAACATGAATGAGATGCCTCATTCCTCAGTTTCCTCACGTGTACTATAAGGCTAGTA 

CCTGCTTTGTTGGGGTATGGTTGGCTCGTGTGCATTAAGTCAACAAATCCCTAGT 

SEQ ID NO: 75_AA013524_M 

CTGGTGCAGCAGGGCGCCGAGGCGCGCGTTTTCCGTGGCCGCTTCCAGGGCCGCGCGGCC 
GTGGTGAAGCACCGCTTCCCGAAGAGTTACCGGCACCCGGAGCTGGAGGCGCGGCTCGGC 
CGTCGGCGGACGGTGCAGGAGGCGCGCGCGCTGCTCCGCTGCCGCCGTGCGGGGATAGCT 
GCCCCAGTCGTCTTCTTTGTGGACTATGCGTCTAACTGCTTATATATGGAAGAAATCGAA 
GACTCGGTGACTGTTCGGGATTATATCCAATCCACTATGGAGACTGAAAAGGACCCCCAG 
TGCCTCTTGGACCTGGCCAGGAGGATGGGGCAGGTTCTGGCCGGAATGCACGACCAAGAC 
CTCATTCACGGGGACCTCACCACCTCCAACATGCTCCTGAGGCGGCCCCTGGCGCAGCTG 
CACATCGTGCTCATCGACTTTGGGCTGAGCTTTGTCTCAGGACTGCCGGAAGATAAAGGC 
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FIGURE 2JJJ 

PTrGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGCACGCACCCCCACACCGAGACCGCG 
TTTGAAGCCTTTCTGAAGAGTTACGGGGCCTCGTCCAAGAAGTCCAGTCCAGTGCTGAAG 
AAGTTAGATGAGGTGCGCCTGAGAGGGCGAAAGCGGTCCATGGTCGGGTAGTGGAGCTGT 
GGTGAACTGGCTCACGGTGAAGGATGATGTAGACGAGGCTGGACCCCTCAGCAAAGCATG 
GGTTGTTAAGTGGTCTGTGATCGTGCTGGGCCACCACCATCCATGGCTCACTGTTCTCAG 
rrrrTTCATGTACATGAGGTTTATTCTGGGCAGAACTGGGTAGGTAGCCCAGGCTAGCCT 
TGAATTTATGGCAACATCCTACCTCAGCTTGCTTGGAAGAGGTTATAAGCCACCATACCT 

GACTTTGCACTGATTCTGTCAGAAAC 

SEO ID NO: 76 17000139801197_H, IRAKM_H 

ATGGCGGGGAACTGTGGGGCCCGCGGCGCGCTGTCGGCGCACACGCTGCTGTTCGACCTG 
CCGCCCGCGCTGCTCGGAGAGCTCTGCGCTGTTCTGGACAGCTGCGACGGCGCGCTGGGC 
TGGCGCGGCCTGGCAGAGAGACTTTCAAGCAGCTGGCTGGATGTTCGTCATATTGAAAAG 

SSagaccaaggtaaaagtggaacaagagaattactttggtcctggg 

AAGACCATCGGTGACCTTTTACAGGTCCTCCAGGAGATGGGACATCGTCGAGCTATTCAT 

™ttacaaactatggagcagtgttgagtccttcagagaagagttatcaggaaggtgga 

TTTCCAAATATATTATTCAAGGAAACAGCCAATGTCACCGTGGATAATGTTCTTATTCCT 

gaaStaatgaaaaaggagtactgcttaaatcttccatcagctttcaaaatatcataga^ 

GGAACTAGAAATTTCCACAAAGACTTCCTAATTGGAGAAGGAGAGATTTTTGAGGTATAC 
AGAGTGGAGATTCAAAACCTAACATATGCTGTCAAATTATTTA^ 

CAGTGTA^GAAGCATTCGAAGAGGTTTTTATCTGAGCTTGAAGTTTTACTACTGTTTCAT 

caSwcatactagagttggctgcatattttacagagactgagaagttctgtctgatt 

TATCCATACATGAGAAATGGAACACTTTTTGACAGATTGCAGTGTGTAGGTGACACGGCC 
CCAC^CCTTGGCACATTCGAATCGGTATATTAATAGGAATATCCAAAGCCATTCACTAC 

Sgcacaacgttcaaccatgctcggtcatctgtggcagtatatcaagtgcaaacatcctt 

Sgga^tcagtttcaacccaaactaactgattttgccatggcacacttccggtccc^ 

ctagaacatcagagttgtaccataaatatgaccagcagcagcagtaaacatctgtggtac 

atgccaga^gagtacatcagacaggggaaactttccattaaaacagatgtctacagcttt 

ggSSgtaataatggaagttctaacaggatgtagagtagtgttagatgatccaaaacat 

atccagctg^gg^atctccttagagaattgatggagaagagaggcctggattcatg^ 

?caStctagataagaaagtgcctccctgccctcggaatttctctgccaagctctt^ 

^SgcIggccggtgtg^ 

S?ac?cSgaaagtactcaagccagcttgtattttgctgaagatcc 

a^gtccttcaggtgtccttctcctctattcctggagaatgtaccaagtattccagtgg^ 

g1?gatga!agccagaataacaatttactaccttctgatgaaggcctgaggatagacaga 

Sgactca^aIctccttttgaatgcagccagtc 

aaaaagccag^gagcaagagaaatgaggaagcttgcaacatgcccagttcttctt^ 
gaaagttggttcccaaagtatatagttccatcccaggacttaaggccctataaggtaaat 
a^agatccttcttcagaagctccagggcattcttgcaggagcaggccagtggagagcagc 
tgttcctccaaattttcctgggatgaatatgaacagtacaaaaaagaataa 



qEO ID NO: 77 AA84 0598 M IRAKM M 

atgtggaagagatttttatcagaactggaagttctactcctgttccgtcacccccacata 

cSgagc^ggctgcatatttcacggagactgagaaactttgtctggtttatccc^ 

SScgggacgcttttcgacagattacagtgca^ 

cacot??gaatcagcgtattgataggaatagccaaagccatccaatacttgcac^ 
cagSgtgc^ccgtcatctgtggcaacgtttccagtgcaaa 

act^accaWtatgaccggcggtggcaggaaacatctgtggtacatgccagaagaa 
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FIGURE 2KKK 

^^TACCACCCrGTCCTOKAACTTCTC^ 



OTGRGCTTGTGGCGCCATCCCAGGACTTAAGTCCTALI^" TTC CTCTGGG 

™gtaccaggccattcttat = 



" W ^TTCTTATGGGAGCAAGCCAATGGAGAAG? 

CTCTTTTGCAGTGAGCATGAAC^ 
AAAATAAAAGCAAACGTCACTGAAGG^^ 
AGCTCTGAGCTCCGTGAGTACAGCCAA^ 

TCTGTTGCGTAGTGTGACTTTGTGGCATGACTT^ 



TGCTAGGGATACTGACAGTCTATTTGCTTCC^ 

Sgg?tgtagggaatgtctaatttgtaaatg^ 

TTATATTAAAGAATTCCAGCACT 

SEQ ID NO: 78_AA088547_H _^_ rrrccGGCTGGGG CTCCAGCTCCAGTTC 
TTTCTCTCTGACCCAGCAGATGGCAGCCTGTACAT^ 



TTAATGAAACTGCCATTCACCATCCL ^^ll^"^ GGTG GACCCTGAGTCA 
GATGGGGTCTTCTACACAGGCCGGAAGCAGG^ATGCCTG^ 

3GG GAGACCCAGATGACACTGACCACAGAGG^^ 

CGAACACAGTATACGGTCAGCA^ 



GGCCAWTCCGACTGCCTGC^GGCCCCCGGGAC^ ^^ rKSQCTTCraTGT CTCT 
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FIGURE 2LLL 

GGACACCACGAGCTACCCCCAGTCCTGCACACCKCA AGGCCCC AGCCTTCTTCTTG 

GACGACCCTGAAGCTGAGCAACTCA^ 
GTGCTGGGCCGCGGGGCAGGCGGGACTTTCGTTTT^ 

GTGGCTGTCAAGCGGCTCCTCCGC^ 



Sgttccactacattgccctggagc^ 

CCGGACCTGGATCGCGGGGGTCTGGAG^^ 
^CATCTTCTCTGCAGGCTGCGTGTTCTACTACGTGCTTTCT 

GGGAGGTGA 

qEO ID NO: 79 HGP_6644466 ^„ rT * TrTGTTG GTGCTAGAGGCAGCTGC 
GGAGGGTTCGAATTGCAACGGCAGCTGCCG^GGCG 

Lggtctcgctgggggccgctcggg^ 
tgcttttactgaagccaatgatggcagtctg^^ 

GTCTCTAAATGACTTAA^ 
CATAATTTTAAAAGTTGCTTTGAATATGGC^ 

GAAACTGCTTCATGGAGACATA^GTCTTCA^T^ 

AATTAAAATCTGTGATGTAGGAGTCTCTCTACCAL^ CAAAGAAGCT GTGGAGGAGAA 
C^CTGAGGCTTGTTACATTGGCACAGAGC^ 

^SSSSSSSSSSSSSSSSSSU ^CAAC^C—GCAC 

•5^113 
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FIGURE 2MMM 

TAATGAAGACCCTAAAGATCGTCCTTCTGCTGCACACATTGTTGAAGCTCTGGAAACAGA 

TGTCTAGTGATCATCTCAGCTGAAGTGTGGCTTGCGTAAATAACTGTTTATTCCAAAATA 

TTTACATAGTTACTATCAGTAGTTATTAGACTCTAAAATTGGCATATTTGAGGACCATAG 

TTTCTTGTTAACATATGGATAACTATTTCTAATATGAAATATGCTTATATTGGCTATAAG 

CACTTGGAATTGTACTGGGTTTTCTGTAAAGTTTTAGAAACTAGCTACATAAGTACTTTG 

ATACTGCTCATGCTGACTTAAAACACTAGCAGTAAAACGCTGTAAACTGTAACATTAAAT 

TGAATGACCATTACTTTTATTAATGATCTTTCTTAAATATTCTATATTTTAATGGATCTA 

CTGACATTAGCACTTTGTACAGTACAAAATAAAGTCTACATTTGTTTAAAACACTGAACC 

TTTTGCTGATGTGTTTATCAAATGATAACTGGAAGCTGAGGAGAATATGCCTCAAAAAGA 

GTAGCTCCTTGGATACTTCAGACTCTGGTTACAGATTGTCTTGATCTCTTGGATCTCCTC 

AGATCTTTGGTTTTTGCTTTAATTTATTAAATGTATTTTCCATACTGAGTTTAAAATTTA 

TTAATTTGTACCTTAAGCATTTCCCAGCTGTGTAAAAACAATAAAACTCAAATAGGATGA 

TAAAGAATAAAGGACACTTTGGGTACCAGAAGGTGTCTCAGCATTATTTTATACTTC 

ID NO: 80 AA449542 M 
ATCTCCAAGAGGGTTGTCTCATTCTCCTTGGGCCGTGAAAAAGATAAGTCTTTTATGCGA 

TGATCATTATCGAACTGTGTATCAGAAGAGACTAACTGATGAAGCTAAGATTTTAAAAAA 

CCTTAATCACCCAAACATTATAGGATATCGTGCTTTTACTGAAGCCAGTGATGGTAGTCT 

GTGCCTTGCTATGGAGTATGGAGGTGAAAAGTCTCTGAATGACTTAATAGAAGAGCGGAA 

CAAAGACAGTGGAAGTCCTTTTCCAGCAGCTGTAATTCTCAGAGTTGCTTTGCACATGGC 

CAGAGGGCTAAAGTACCTGCACCAAGAAAAGAAGCTGCTTCATGGAGACATAAAGTCTTC 

AAATGTTGTAATTAAAGGTGATTTTGAAACAATTAAAATCTGTGATGTAGGAGTCTCTCT 

GCCATTGGATGAAAATATGACTGTGACTGATCCTGAGGCCTGTTATATTGGTACTGAGCC 

ATGGAAACCCAAGGAAGCGTTGGAAGAAAATGGCATCATTACTGACAAGGCAGATGTGTT 

TGCTTTTGGCCTTACTCTGTGGGAAATGATGACTTTATGTATTCCACACGTCAATCTTCC 

AGATGATGATGTTGATGAAGATGCAACCTTTGATGAGAGTGACTTCGATGATGAAGCATA 

TTATGCAGCTCTGGGGACAAGGCCATCCATCAACATGGAAGAGCTGGATGACTCCTACCA 

GAAGGCCATTGAACTCTTCTGTGTGTGCACTAATGAGGATCCTAAAGATCGCCCGTCTGC 

TCCACACATCGTTGAAGCTTTGGAACTAGATGGCCAATGTTGTGGTCTAAGCTCAAAGCA 

TTAACTTGTATGGGAACTGTTAACTAGATATATGTAGTTAATATAACTTATGGTAGCTAG 

ATTCTAGAAGTAGCTTTAACACTAGTGACCCCTGTCTAAGATGACTTAAGAATCAAGGGA 

CCATTGCTTTGTTACAGATCTTTTTAGATATTCTTGCTTCTTTAGTGGGTTACTAAAAAT 

TTCACTACGTACATGTGGTACAGATATCTGTCTGCTCATAGTGTCAGTCCTTCAGCTGGC 

CTGTCAGCCCATGCGCCCTGGGACTTGAGAAGAGTTCATAAACGTAGCTCCTAGGGTGTC 

TTGCCTCTCTACACTTAGCTTCTAATTTATTACTTTGTTTCTACTGATTGTGTCTTAAGT 

CTTTTAAAATAAATGTAAGAATAAACAATAAAAGACAGTTTTAGTACCAGG 

SEO ID NO: 81 5R57 10 2 M TESK2_M 

GCTGCTGGACAGTGACTTGTATTTACCGTGGACTGTGAGAGTGAAACTGGCCTATGGCAT 
AGCAGTGGGCCTCAGCTACCTTCACTTCAAAGGCATTTTCCATCGGGACCTCACATCAAA 

GGTGTGAAGGCTTTGCTTTC 
SEO ID NO: 82 AA232253 H 

ATGTCGTCTCTCGGTGCCTCCTTTGTGCAAATTAAATTTGATGACTTGCAGTTTTTTGAA 
AACTGCGGTGGAGGAAGTTTTGGGAGTGTTTATCGAGCCAAATGGATATCACAGGACAAG 
GAGGTGGCTGTAAAGAAGCTCCTCAAAATAGAGAAAGAGGCAGAAATACTCAGTGTCCTC 
AGTCACAGAAACATCATCCAGTTTTATGGAGTAATTCTTGAACCTCCCAACTATGGCATT 
GTCACAGAATATGCTTCTCTGGGATCACTCTATGATTACATTAACAGTAACAGAAGTGAG 
GAGATGGATATGGATCACATTATGACCTGGGCCACTGATGTAGCCAAAGGAATGCATTAT 
TTACATATGGAGGCTCCTGTCAAGGTGATTCACAGAGACCTCAAGTCAAGAAACGTTGTT 
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FIGURE 2NNN 

&TAGCTGCTGATGGAGTATTGAAGATCTGTGACTTTGGTGCCTCTCGGTTCCATAACCAT 
ACAACACACATGTCCTTGGTTGGAACTTTCCCATGGATGGCTCCAGAAGTTATCCAGAGT 
CTCCCTGTGTCAGAAACTTGTGACACATATTCCTATGGTGTGGTTCTCTGGGAGATGCTA 
ACAAGGGAGGTCCCCTTTAAAGGTTTGGAAGGATTACAAGTAGCTTGGCTTGTAGTGGAA 
AAAAACGAGAGATTAACCATTCCAAGCAGTTGCCCCAGAAGTTTTGCTGAACTGTTACAT 

GAGTCCATGTCAAATGAGACGAGCCTTCCTGACAAGTGTAACTCATTCCTACACAACAAG 
GCGGAGTGGAGGTGCGAAATTGAGGCAACTCTTGAGAGGCTAAAGAAACTAGAGCGTGAT 
CTCAGCTTTAAGGAGCAGGAGCTTAAAGAACGAGAAAGACGTTTAAAGATGTGGGAGCAA 
AAGCTGACAGAGCAGTCCAACACCCCGCTGCTGCCTTCCTTTGAGATTGGTGCATGGACG 

Sagacgatgtgtattggtgggttcagcagctcgtcagaaaaggtgactcttcagcagag 

ATGAGTGTATATGCAAGCTTGTTTAAAGAAAACAACATTACAGGGAAGCGGCTGCTGCTG 

Sggaggaagaagacctgaaagacatgggcattgtctccaaggggcatatcattca^ 

AAGTCAGCCATTGAGAAATTAACCCATGATTACATAAATTTGTTTCACTTCCCACCACTA 

a^?aaggactcaggaggtgaacctgaagaaaatgaggaaaaaatagtgaacctgga^ 

GTTCTTGGTTTTCACTTGAAACCAGGAACTGGCCCACAGGATTGTAAGTGGAAAATGTAT 

atcgagatggatggggatgaaattgcaataacctacataaaagatgtgacattcaac^^ 

a^cSacctgatgcggagattttaaagatgacaaagccaccatttgtaatggagaagtc^ 

^Saggaatagcaaaaagtcagactgtggagtgcactgtcacatatgagagtgatgtt 

agSctccaaaaagcactaaacatgtccatttgattcagtggagtagaacaaaacctc^ 

gatcaagtgaaagcagtccaacttgccattcagacattattcaccaattcagatggcaac 

Stc^gcaggtccgactcaagtgctgattgccagtggttagatactctgaggatgcgg 

caga?tccatccaacacttctttacagcgttcccagagcaatcctattctggggtcaccg 

ttcttctcacactttgatggccaggattcctacgctgctgctgtgagacggccccaggtg 

ScStaagtatcaacagattacacctgtgaaccagtccagaagctcgtctcctactcag 

?atc™gaccaaaaacttctcttccttacatctcaactctagggacagtggcttttcc 

Igtcgcaatactgacacctcttcagagaggggtcgatactcagacagaagcaggaacaaa 

SggaotggtLtatatcactcaattcttc 
SgS^ccactccatcaagaggaagataccctggaa^ 

gcactcaatcctcaccagtcgcctgacttcaagagaagccccagggacctccaccaaccc 
SSaccataccagggatgcctttgcaccctgagactgactcaagagccagtgaagaggac 
a^?1aagtcagcgaagggggctggacaaaagtggaataccggaaaaagccccacaggcca 
tct^gccaaaaccaataaagagagagccagaggggaccaccgtggatggagaaacttt 

TGA 

atgggaaattataaatctagacca^cccaaacttgt^ 

Sga^IStgttatcacaatagaaagattagaagatgacctgcagatcaag^ 

gSctcacagaactaaggaatatatttggctctgatgaagccttcagtaaagtcaattta 

aa^taccgcactgaaaatgggctgtctctacttcatttatgttgcatttgtggaggcaag 

^acLattcgaactcttatgttgaaagggctccgcccatctcgactgacaagaa^ 

GGATTTACAGCCTTGCATTTAGCAGTTTACAAGGATAATGCAGAATTGATCACTTCTCTG 

c??Scag?ggLctgatatacagcaggttggatacggtggcctcactgccctcca 

SacS?Igctggccacctagaggctgctgatgtgctgttgcaacatggagctaat^ 

a^Sttcaagatgcagtttttttcactccattgcatattgcagcgtactatggacatgaa 

ga?aStccacctagcatctgc 

gaagaaggcagcaaagcagatgtgaatgctcaagataatgaagaccatgtcccactccat 
?^?gttctcgatttggacaccatgatatagttaagtatctgctgcaaagtgatttggaa 
otcaIJScatgttgttaatatctatggagatacccccttacacctggcatgctacaat 
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FIGURE 2000 

GGCAAATTTGAAGTTGCCAAGGAAATCATCCAAATATCAGGAACAGAAAGTCTGACTAAG 

GAAAACATCTTCAGTGAAACAGCTTTTCATAGTGCTTGTACCTATGGCAAGAGCATTGAC 

CTAGTCAAATTTCTTCTTGATCAGAATGTCATAAACATCAACCACCAAGGAAGGGATGGG 

CACACTGGATTACACTCTGCTTGCTACCACGGTCACATTCGCCTGGTTCAGTTCTTACTG 

GATAATGGAGCTGATATGAATCTAGTGGCTTGTGATCCCAGCAGGTCTAGTGGTGAAAAA 

GATGAGCAGACATGTTTGATGTGGGCTTATGAAAAAGGGCATGATGCCATTGTCACACTC 

CTGAAGCATTATAAGAGACCACAAGATGAATTGCCCTGTAATGAATATTCTCAGCCTGGA 

GGAGATGGCTCCTATGTGTCTGTTCCATCACCCTTGGGGAAGATTAAAAGCATGACAAAA 

GAGAAGGCAGATATTCTCCTCCTAAGAGCTGGATTGCCTTCACATTTCCATCTTCAGCTC 

TCAGAAATTGAGTTCCATGAGATTATTGGCTCAGGTTCTTTTGGGAAAGTATATAAAGGA 

CGATGCAGAAATAAAATAGTGGCTATAAAACGTTATCGAGCCAATACCTACTGCTCCAAG 

TCAGATGTGGATATGTTTTGCCGAGAGGTGTCCATTCTCTGCCAGCTCAATCATCCCTGC 

GTAATTCAGTTTGTGGGTGCTTGCTTGAATGATCCCAGCCAGTTTGCCATTGTCACTCAA 

TACATATCAGGGGGTTCTCTGTTCTCCCTCCTTCATGAGCAGAAGAGGATTCTTGATTTG 

CAGTCTAAATTAATTATTGCAGTAGATGTTGCCAAAGGCATGGAGTACCTTCACAACCTG 

ACACAGCCAATTATACATCGTGACTTGAACAGTCACAATATTCTTCTCTATGAGGATGGG 

CATGCTGTGGTGGCAGATTTTGGAGAATCAAGATTTCTACAGTCTCTGGATGAAGACAAC 

ATGACAAAACAACCTGGGAACCTCCGTTGGATGGCTCCTGAGGTGTTCACGCAGTGCACT 

CGGTACACCATCAAAGCAGATGTCTTCAGCTATGCTCTGTGTCTGTGGGAAATTCTCACT 

GGCGAAATTCCATTCGCTCATCTCAAGCCAGCGGCTGCGGCAGCAGACATGGCTTACCAC 

CACATCAGACCTCCCATTGGCTATTCCATTCCCAAGCCCATATCATCTCTGCTGATACGA 

GGGTGGAACGCATGTCCTGAAGGAAGACCCGAATTTTCTGAAGTTGTCATGAAGTTAGAA 

GAGTGTCTCTGCAACATTGAGCTGATGTCTCCTGCATCAAGTAACAGCAGTGGGTCTCTC 

TCACCTTCTTCTTCTTCTGATTGCCTGGTGAACCGGGGAGGACCTGGCCGGAGTCATGTG 

GCAGCATTAAGAAGTCGTTTCGAATTGGAATATGCTCTAAATGCAAGGTCCTATGCTGCT 

TTGTCCCAAAGTGCTGGACAATATTCCTCTCAAGGTCTGTCTTTGGAGGAGATGAAAAGA 

AGTCTTCAATACACACCCATTGACAAATATGGCTATGTATCCGATCCCATGAGCTCAATG 

CATTTTCATTCTTGCCGAAATAGTAGCAGCTTTGAGGACAGCAGCTGA 

SEQ ID NO: 84_H976 8 5_H 

ATGATTTCTTGCCTGTNATAACCTATGCACTCACAAAGATGAACTCTCTGAGAGGGATGA 
GCAAGAGCTTCAGGAAATCCGAAAGTATTTCTCCTTTCCTGTATTCTTTTTCAAAGTGCC 
GAAACTGGGCTCGGAGATAATAGACTCCTCAACCAGGAGAATGGAGAGCGAAAGATCACC 
GCTTTATCGCCAGCTAATTGACCTGGGCTATCTGAGCAGCAGTCACTGGAACTGTGGGGC 
TCCTGGCCAGGATACTAAAGCTCAGAGCATGTTGGTGGAACAGAGTGAAAAGCTGAGACA 
CTTGAGCACATTTTCTCACCAGGTGTTACAGACTCGCCTGGTGGATGCAGCCAAGGCCCT 
GAACCTGGTGCACTGCCACTGCCTTGACATCTTTATTAACCAGGCATTTGACATGCAGCG 
GGACCTGCAGATCACTCCCAAACGTCTGGAATATACTCGAAAAAAGGAGAATGAGTTGTA 
TGAATCATTGATGAATATTGCCAACCGAAAGCAGGAGGAAATGAAGGATATGATTGTTGA 
GACACTTAATACCATGAAGGAGGAACTTCTGGATGATGCTACTAACATGGAGTTTAAAGA 
CGTCATTGTCCCTGAGAATGGAGAACCAGTAGGCACCAGAGAGATCAAATGCTGCATCCG 
ACAGATCCAGGAACTCATCATCTCCCGACTTAATCAGGCAGTGGCTAATAAGCTGATCAG 
CTCAGTGGATTACCTGAGGGAAAGCTTCGTCGGAACCCTGGAACGATGTCTGCAGAGCCT 
GGAGAAGTCTCAGGATGTCTCAGTTCACATCACCAGTAATTATCTCAAACAGATCTTAAA 
TGCTGCCTATCATGTTGAAGTCACGTTTCACTCAGGGTCGTCAGTTACAAGGATGCTATG 
GGAGCAAATCAAACAGATCATCCAGCGCATCACATGGGTGAGCCCACCTGCCATCACTCT 
GGAATGGAAGAGGAAGGTGGCCCAGGAAGCCATTGAGAGCCTCAGCGCCTCCAAATTGGC 
TAAGAGCATTTGCAGCCAATTCCGGACTCGGCTCAATAGTTCCCACGAGGCTTTTGCAGC 
CTCCTTGCGGCAGCTGGAAGCTGGCCACTCAGGCCGGTTAGAGAAAACGGAAGATCTATG 
GCTGAGGGTTCGGAAAGATCATGCTCCCCGCCTGGCCCGCCTTTCTCTGGAAAGCCGTTC 
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FIGURE 2PPP 

TTTACAGGATGTCTTGCTTCATCGTAAACCTAAACTGGGACAGGAACTGGGCCGGGGCCA 
GTATGGTGTGGTATACCTGTGTGACAACTGGGGAGGACACTTCCCTTGTGCCCTCAT^ATC 
AGTTGTCCCTCCAGATGAGAAGCACTGGAATGATCTGGCTTTGGAATTTCACTATATGAG 
GTCTCTGCCGAAGCATGAGCGATTGGTGGATCTCCATGGTTCAGTCATTGACTACAACTA 
TGGTGGTGGCTCCAGCATTGCTGTGCTCCTCATTATGGAGCGGCTACACCGGGATCTCTA 
CACAGGGCTGAAGGCTGGGCTGACCCTGGAGACACGTTTGCAGATAGCACTAGATGTGGT 
GGAGGGAATCCGCTTCCTGCACAGCCAGGGACTTGTCCATCGTGATATCAAACTGAAAAA 
TGTGCTGCTGGATAAGCAGAACCGTGCCAAGATCACTGACTTAGGATTCTGCAAGCCAGA 
GGCCATGATGTCAGGCAGCATTGTGGGGACACCAATCCATATGGCCCCTGAACTTTTCAC 
AGGGAAGTACGATAATTCCGTGGATGTCTACGCTTTTGGAATTCTTTTCTGGTATATCTG 
CTCAGGCTCTGTCAAGCTCCCTGAGGCATTTGAGAGGTGTGCTAGCAAAGACCATCTCTG 
GAACAATGTGCGGAGGGGGGCTCGCCCAGAACGTCTTCCTGTGTTTGATGAGGAGTGCTG 
GCAGTTGATGGAAGCCTGTTGGGATGGCGACCCCTTGAAGAGGCCTCTCTTGGGCATTGT 
CCAGCCCATGCTCCAGGGCATCATGAATCGGCTCTGCAAGTCCAATTCTGAGCAGCCAAA 
CAGAGGACTAGATGATTCTACTTGAAAGCAAAGACCTTTCTCTTTCACTCTCTAGTTATT 
TCCTTCCCCCTCACCATTTGGCCATGGGGAGAATTTGACATTTATTCACTATAGGACACA 
CTCCCAAGGGAACTGGTGCTTGCTGGGAAACTTGGAACCTTCCCAGGCAGGGATGACTCC 
TGGACAGTGAAGAGTTGAATGACTGAGCATATTCAGCAGCTCACTGAAGCGCCAAGCTAT 
CCCTTTAGCAAAAAAGTGTCTCAGATGTGTAAAAGCTGAGGAATGTGGTGTTCTGGCTTC 
ACAAATGAAAAGGAGGCAGATGTT 

SEQ ID NO: 85_W2 0810_M 

TTGATGTCAACCTGAAGGCTTCTAAAGCGAGTGATGTCTACAGCTTTGGGATCCTCGTGT 
GGGCAGTGCTGGCTGGCAGAGAAGCTGAGTTGGTAGACAAGACTTCACTAATCCGGGAAA 
CAGTGTGTGACAGGCAGAGTCGTCCTCCACTGACAGAGCTGCCTCCAGGTAGCCCTGAGA 
CTCCCGGCTTGGAAAAACTGAAGGAGTTAATGATTCATTGCTGGGGTTCCCAGTCCGAAA 
ACAGGCCATCCTTCCAGGACTGCGAACCAAAAACCAATGAAGTTTACAATCTGGTAAAGG 
ACAAGGTAGATGCTGCTGTCTCCGAGGTAAAGCATTATCTGTCTCAGCACAGAAGCAGCG 
GCAGAAACTTGTCTGCCAGAGAGCCAAGCCAAAGAGGCACAGAAATGGATTGCCCGAGGG 
AAACCATGGTTTCTAAAATGCTGGACCGCCTGCATTTGGAGGAACCCTCCGGACCAGTTC 
CTGGAAAATGTCCTGAGAGGCAAGCACAGGACACATCAGTTGGGCCTGCCACACCAGCAA 
GGACATCTTCTGACCCCGTGGCTGGCACTCCTCAGATTCCACATACTTTACCCTTCAGAG 
GCACAACACCTGGGCCAGTCTTTACTGAGACTCCCGGTCCTCACCCCCAAAGGAATCAGG 
GAGATGGAAGACACGGCACTCCTTGGTATCCCTGGACCCCACCGAATCCAATGACAGGGC 
CACCGGCTCTCGTCTTCAACAACTGTTCTGAAGTGCAGATTGGGAACTACAACTCCTTGG 
TAGCACCACCAAGAACTACTGCCTCAAGTTCGGCCAAGTATGACCAAGCACAGTTCGGCA 
GGGGTAGGGGCTGGCAGCCCTTCCACAAGTAGACTTCAGAGAATCACTGCAAGAGCCTGA 
AGTGTGCCATTCAGCGTGGCAATAAAAAGCACGTTTTAAGCAACCTGGACTGGCTAAGAC 
AGTCCTTGCCACTTCCTGAAGCTCACAACATTCTGTGAGGACAGTTGGACCTACACCCAA 
ACTGACTCTTGACCCATCTCCTTAAAGTCAATAAACATAGCATGTTAACTGTG 

SEQ ID NO: 86_AA744236_H 

ATGGGATCAGAGAACAGTGCTTTAAAGAGCTATACACTGAGAGAACCACCATTTACCTTA 
CCCTCTGGACTTGCTGTTTATCCCGCTGTACTGCAAGATGGCAAATTTGCTTCAGTTTTT 
GTGTATAAGAGAGAAAATGAAGACAAGGTTAATAAAGCTGCCAAGCATTTGAAGACACTT 
CGTCACCCTTGCTTGCTAAGATTTTTATCTTGTACTGTGGAAGCGGATGGCATTCATCTT 
GTCACTGAGCGAGTACAGCCCCTGG71AGTGGCTTTGGAAACATTGTCTTCTGCAGAGGTC 
TGTGCTGGGATCTATGACATATTGCTGGCTCTTATCTTCCTTCATGACAGAGGACACCTA 
ACACACAATT^ATGTCTGTTTATCATCTGTGTTTGTGAGTGAAGATGGACACTGGAAGCTA 
GGAGGAATGGAAACTGTTTGTAAAGTTTCTCAGGCCACACCAGAGTTTCTGAGGAGTATT 
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FIGURE 2QQQ 

GTTTTGCTGGGCCTGCGTGATACTAGC^^ 

CCTGAGGAGCCTGAAAATCAAACTGTCAACATACAG^ 

GATGTCAAGTCCCAGTGCACTACCTTGGAT^^ 
CCCAGCAGCTTAGATACTAAAGTAAACCCAGGAGGTGGAA 

ACCTCAGGGGAGCAGAAGCCTATTCCTGCTTTGCm 

TGGAAATCAAGCTTACCCCAAAAGATTAG^TTGTAC^ 

ATCGAGCCGCCAAAAGTGTCATCACAAGAAAGGCCCCTTAA 

TTAGGAGAGGAATTCACCATTCAAGTAAAAAAGAAGCCA^ 

TGGTTTGCTGATATGATCCCAGAAATTAAGCCTTCT 

CTGAACTGGGAAGATAATAACTGGTGA 

SEQ ID NO: 87 - AI °"^-; TaTrGAGT ^ GCC AGAGTCAGTGGCCAGGCACGM 
ACATTAAG^TTAT^CmA^raG^CCAARTATG TCACTCCTGftAA 
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FIGURE 2RRR 

55S=S5SE==»= 



qEO ID NO: 88 AA278842_H _ r , r , rT( -.pr«T'CAGCCCCGGCTCGGGCGGCCGGAG 
rCGG^CCCTGGGCCCTGCACCGCGGCCGCAAGAAGGCCAC^G 

ScStatLtgtgaagcctggcgcggaaga^ 

IgCGOTCAAAACTCTACGGCACCCCAACAT^ 

Stgcctccacgtcgtgacagaggc^ 
^gLgctggtggcctgaaggagctggagat^ 

SSgtcctgggctt^ 
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FIGURE 2SSS 

^^^^^^^^^^^^ 
ATCAGAGCCACAATAAATTCTATTTCAC 

SEQ ID NO: 89_AA599286 H GCTGCTGGACG ACACGGTGCCGCTG 
ATGGCCTTCATGGAGAAGCCGCCAGCCGGCAAGG^ 

ATTGGTAAGATGGATCGTGAATTGATAGCTGA^^ 

AACGTGATCACAACAAATCATATCTTGTCTAATTGTGAGC^ 

CCAAACAACTATTCGGCAAACTATACTGAGATTGCCTTGC^ 

CGATCAGAGCGAAAGTGGGAGGTGGTGGAACOT^^ 

AAGAAATATTTCTTGATGAAGATTAAAAATCAGC^ 

G^TGAGCTTGGCCCAGACIAAGTATTTGTCAGATAAAGATT^ 

TGAGCGTTGCTAATTAGGATGTTTAACGA^^ 

GCAAAACCAAAAGACCCATTTCTAAAG 

GAACTCCAGGAAATAAAAACATATGGACGGCAAAm^ 

GACAAGGGATTCCCTTATGGGCATC^ 

TGCCGGCTGCTGGACCTTGAGAATTCCTTATl^uu ACTGCTTTGGC CAC 

CGAAGACTGACAAGAGCTCAGTCCCACCAT^^ 

TCTCAGGCTGTGAATGGCATGAGCCGAGGGGCCTTGCT 
AAAGGAACTTTGAGGAAAGCCAAA^ 

TGGCATGCAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 90_AA425725_H GGCAGCGGCG GCAGTAGCAGCAGC 
ATGAGCGCCAGCACGGGCGGTGGTGGGGACAGC^ 

TCACAGGCCTCCTGCGGGCCCGAGTCCTCGGGCTC 

CCTCAGATGCTGCAGGGCCTTCTGGGCTCCG^ACGACG^ 

TACTGCAAGGGCGGCTACCACCCTGTGAAGATCG^G 

GTGGTGCGCAAACTGGGCTGGGGCGACT^ 
CGCAAGCGCTTTGTGGCCCTCAAAGTGGTGAAGAGTG^^^ ^ 
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FIGURE 2TTT 



TGCATGGTGCTGGAGGTGCTGGGCCACCAGCTCCTCAAATGGATCATCAAGTCCAACTAC 
CAGGGCCTGCCCGTGCCCTGCGTGAAGAGCATCGTGAGGCAGGTGCTGCACGGCCTGGAC 
TACCTCCACACCAAGTGCAAGATCATCCACACGGACATCAAGCCCGAGAACATCTTGCTG 
TGTGTGGGGGACGCTTACATCAGGCGCCTGGCTGCCGAGGCCACGGAGTGGCAACAGGCA 
GGGGCGCCGCCCCCCTCCCGCTCCATAGTCAGCACTGCCCCCCAGGAGGTCTTGACCGGT 
AAGCTGTCCAAAAACAAGAGGAAGAAGATGAGGCGCAAACGGAAACAGCAGAAGCGGCTG 
CTGGAGGAGCGGCTGCGGGACCTGCAGAGGCTGGAGGCCATGGAGGCTGCCACCCAGGCT 
GAGGACTCTGGCTTGAGACTAGACGGGGGCAGCGGCTCCACATCCTCTTCAGGCTTCTCC 
GGCTCCCTCTTCTCTCCTGCCTCCTGCTCCATCCTCTCCGGCTCGTCCAATCAGCGAGAG 
ACCGGGGGCCTCCTGTCGCCTAGCACACCATTCGGTGCCTCGAACCTCCTGGTGAACCCC 
CTGGAGCCCCAAAATGCAGATAAGATCAAGATCAAGATCGCAGACCTGGGCAACGCCTGC 
TGGGTGCACAAGCACTTCACGGAAGACATCCAGACTCGGCAGTACCGGGCCGTCGAGGTG 
CTGATCGGCGCCGAATACGGCCCCCCGGCAGACATCTGGAGCACAGCCTGCATGGCCTTC 
GAGCTGGCCACTGGTGACTACCTGTTCGAGCCGCATTCTGGAGAAGACTACAGTCGTGAT 
GAGGACCACATCGCTCACATAGTGGAGCTTCTGGGGGACATCCCCCCAGCCTTCGCCCTC 
TCAGGCCGCTATTCCCGGGAGTTCTTCAACCGGAGAGGAGAGCTGCGGCACATCCACAAT 
CTCAAGCACTGGGGCCTGTACGAGGTACTCATGGAAAAGTACGAGTGGCCCCTAGAGCAG 
GCCACACAGTTCAGCGCCTTTCTGCTGCCCATGATGGAGTACATCCCCGAAAAGCGGGCC 
AGTGCCGCTGACTGCCTCCAGCACCCCTGGCTCAACCCCTAG 

SEQ ID NO: 91_SGK022_H 

TCTGGCCCTGTCCCTCCCCACCACCCGCCGCTGTGTCCAGACAGAGAATGTTCTAACGCT 
GGGGGCGGCTGCGGATGAAGTCCTTGGGGAGAAAAGGAGCAGGCCAAGGGCGATGGTGGA 
GTAGAGCTGCCTCTCAGAGGCAGCATGAGCTGAGAGGGTGATAGGAAGGCGGCGCTAGAC 
AGCATGGAGGACTTTCTGCTCTCCAATGGGTACCAGCTGGGC7VAGACCATTGGGGAAGGG 
ACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAACACCAAAGAT^AAGTGGCAATTAAA 
GTTATAGACAAGATGGGAGGGCCATCAGAGTTTATCCAGAGATTCCTCCCTCGGGAGCTC 
CAAATCGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCT 
GCCGACGGGAAAATCTGCCTGGTGATGGAGCTCGCTGAGGGAGGGGATGTCTTTGACTGC 
GTGCTGAATGGGGGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTT 
GAGGCCATCCGCTACTGCCATGGCTGTGGTGTGGCCCACCGGGACCTCAAATGTGAGAAC 
GCCTTGTTGCAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCC 
AAGTCACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAG 
GTGCTGCAGGGCATTCCCCACGATAGCAAAAAAGGTGATGTCTGGAGCATGGGTGTGGTC 
CTGTATGTCATGCTCTGTGCCAGCCTACCTTTTGACGACACAGACATCCCCAAGATGCTG 
TGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCATCTGAGCATCTCGGCCGATTGCCAG 
GACCTGCTCAAGAGGCTCCTGGAACCCGATATGATCCTCCGGCCTTCAATTGAAGAAGTT 
AGTTGGCATCCATGGCTAGCAAGCACTTGATAAAAGCAATGGCAAGTGCTCTCCAATAAA 
GTAGGGGGAGAAAGCAAA 

SEQ ID NO: 92_AA0 6002 6_M SGK022_M 

CAGACGGAGAATGTTCTAGCCCTGGAGGCAGCTGTGAATGAAGTCCTTGGGGGGAAAAGA 
AGCAGGCCGAGGGCGATGGTGGAGTAGAGCTGCCTCGCAGAGGCAGCATGAGCTGAGAGG 
GTGACAAGAAGGAGGCGCTACACAGCATGGAGGACTTTCTACTCTCCAATGGGTATCAGC 
TGGGCAAGACCATTGGGG7^AGGGACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAAC 
ATCAAAGAAAAGTGGCAATTAAAATTATAGACAAGATGGGAGGGCCAGAAGAGTTTATCC 
AGAGATTCCTGCCTCGTGAGCTCCAGATTGTCCGTACCCTGGACCACAAAAACATCATCC 
AGGTGTATGAGATGCTGGAGTCAGCAGATGGAAAAATCTACCTGGTGATGGAACTGGCTG 
AGGGAGGGGATGTCTTTGACTGTGTGCTGAACGGAGGGCCACTTCCCGAGAGCCGGGCCA 
AGGCCCTCTTCCGCCAGATGGTTGAGGCTATTCGCTATTGCCATGGCTGTGGCGTGGCCC 
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FIGURE 2UUU 

ACCGGGACCTTAAGTGTGAGAACGCCTTGTTGCAGGGCTTCAACCTGAAGCTGACCGACT 
TTGGCTTTGCCAAGGTGCTACCCAAGTCACGCAGGGAGCTGAGCCAGACCTTCTGTGGCA 
GCACAGCCTATGCCGCCCCTGAGGTGCTACAGGGCATACCCCATGATAGCAAGAAAGGTG 
ATGTCTGGAGCATGGGTGTGGTCCTGTATGTAATGCTCTGTGCAAGTCTACCTTTTGATG 
ACACAGATATCCCCAAGATGCTGTGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCATT 
TGGGCATCTCAACCGAATGCCAGGACCTGCTCAAGCGGCTCCTGGAACCAGACATGATAC 
TCCGGCCTTCAATCGAAGAAGTTAGTTGGCACCCATGGCTAGCAAGCACTTGATAAAAGC 
AATGGCAAGTCCTCCCCAATAAAGTAGGGGGAGAAAGCAAACTG 

SEQ ID NO: 93 AA3 9 966 9_H 

CTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCGCACTTCATTCTCAA 

GTTTTGTGGCCAACGATGGATAGGAGGTGGATTGTGATGTATTCGGAACATGGGACCTTG 

AGGAGTTCCGTAACCAAAAGGAGAAAGTAACAACAGCCAGTGGAGACAAAAAGAACTGCT 

TCTCTTTCTTTCCCCCTCCAAGTTCCTAGTGGAGGGCTGAGTCCAGCATCCCAGACTCGT 

GTGACTATATAGGCAAGCATTTGGGGACCTACTTCACTTTGATACCCTAGCCTTCAGCAG 

CTCAAGGTGTTGGCCTTTGGATAGGAGGCTTCCAAGTAGTAAAGCTCCCTGCTCTCAGCA 

AGCCCAACACCATGGGGAAGGGAGATGTCTTAGAGGCAGCACCAACCACCACAGCCTACC 

ATTCCCTCATGGATGAATATGGTTATGAGGTGGGCAAGGCCATTGGCCATGGCTCCTATG 

GGTCGGTATATGAGGCTTTCTACACAAAGCAGAAGGTTATGGTGGCAGTCAAGATCATCT 

CAAAGAAGAAGGCCTCTGATGACTATCTTAACAAGTTCCTGCCCCGTGAAATACAGGTAA 

TGAAAGTCTTGCGGCACAAGTACCTCATCAACTTCTATCGGGCCATTGAGAGCACATCTC 

GAGTATACATCATTCTGGAACTGGCTCAGGGTGGTGATGTCCTTGAATGGATCCAGCGCT 

ACGGGGCCTGCTCTGAGCCCCTTGCTGGCAAGTGGTTCTCCCAGCTGACCCTGGGCATTG 

CCTACCTGCACAGCAAGAGCATCGTGCACCGGGACTTAAAGTTGGAGAACCTGTTGCTGG 

ACAAGTGGGAGAATGTGAAGATATCAGACTTTGGCTTTGCCAAGATGGTGCCTTCTAACC 

AGCCTGTGGGTTGTAGCCCTKCTTACCGCCAAGTGAACTGCTTTTCCCACCTCAGCCAGA 

CTTACTGTGGCAGCTTTGCTTACGCTTGCCCAGAGATCTTACGAGGCTTGCCCTACAACC 

CTTTCCTGTCTGACACCTGGAGCATGGGCGTCATCCTTTACACTCTAGTGGTCGCCCATC 

TGCCCTTTGATGACACCAATCTCAAAAAGCTGCTAAGAGAGACTCAGAAGGAGGTCACTT 

TCCCAGCTAACCATACCATCTCCCAGGAGTGCAAGGTCCAACTGCTCATTGCCTGTGTGG 

CACAATGGAGAAAAACTCAGGCAAGACCTCTCTCTCCCCTGCTCTAGAACCTGATCCTCC 

AGATGCTACGCCAAGCCACTAAGCGTGCCACCATTCTGGACATCATCAAGGATTCCTGGG 

TGCTCAAGTTCCAGCCTGAGCAACCCACCCATGAGATCAGGCTGCTTGAGGCCATGTGCC 

AGCTCCACAACACCACTAAACAGCACCAATCCTTGCAAATTACGACCTGAAAATGGCTGA 

GGGAGGGGGCTAAGAGAGGAGCAAAGCAGGAGGTCTTGGGCTAAAAATCTTTTTTACCAA 

AAATAAATCTAAGTCTGATTTAGTTTCATCAAAAAAA 
SEQ ID NO: 94 AA758539_H 

GACCATTCAGACGCCTCCGGTAGTGTAAATGAGGACAATGCCTGCTGGCCCACATGACGG 
GGGGATGTAGACGGCAGCGGCGCCAGTCGCTCCTGGCACCATGGACGATGCCACAGTCCT 
AAGGAAGAAGGGTTACATCGTAGGCATCAATCTTGGCAAGGGTTCCTACGCAAAAGTCAA 
ATCTGCCTACTCTGAGCGCCTCAAGTTCAATGTGGCTGTCAAGATCATCGACCGCAGGAA 
AACACCTACTGACTTTGTGGAGAGATTCCTTCCTCGGGAGATGGACATCCTGGCAACTGT 
CAACCACGGCTCCATCATCAAGACTTACGAGATCTTTGAGACCTCTGACGGACGGATCTA 
CATCATCATGGAGCTTGGCGTCCAGGGCGACCTCCTCGAGTTCATCAAGTGCCAGGGAGC 
CCTGCATGAGGACGTGGCACGCAAGATGTTCCGACAGCTCTCCTCCGCCGTCAAGTACTG 
CCACGACCTGGACATCGTCCACCGGGACCTCAAGTGCGAGAACCTTCTCCTCGACAAGGA 
CTTCAACATCAAGCTGTCTGACTTTGGCTTCTCCAAGCGCTGCCTGCGGGACAGCAATGG 
GCGCATCATCCTCAGCAAGACCTTCTGCGGGTCGGCAGCATATGCAGCCCCCGAGGTGCT 
GCAGAGCATCCCCTACCAGCCCAAGGTGTATGACATCTGGAGCCTGGGCGTGATCCTGTA 
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FIGURE 2VVV 

CATCATGGTCTGCGGCTCCATGCCCTATGACGACTCCGACATCAGGAAGATGCTGCGTAT 
rCAGAAGGAGCACCGTGTGGACTTCCCGCGCTCCAAGAACCTGACCTGCGAGTGCAAGGA 

cct^SacScLgctgcagcccgacgtcag^ 

CAGCCACTCGTGGCTGCAGCCCCCCAAGCCCAAAGCCACGTCTTCTGCCTCCTTCAAGAG ^ 

Sgagggggagggcaagtaccgcgctgagtgcaaactggacaccaagaca 

CGACCA^CGGCCCGACC^ 

SagaacgagScaggatggaggacagg 

CATCTCCGGAGCTGAGGTGGGGAAAGCAAGCACCTAGCATGACAATGGCCCCGTTGTGTG 

tcgtcggLtcggggttggggggcatggtgcagtcggccttcacgtaaactaagtaggc^ 
gg?Igga?Sgaagaaggcacaggtgcaagtaa^^ 

TT 

aagg?gSggaccggcggcgagcgcccccggacttcgtcaacaag^ 
ctg?^cSgcggLcgtgcgacaccc 

Sgtc^gggaaactgtacatcgtgatggaagcggccgccaccgacct^ 
Sgcagc^caacgggcgcatccccggagttcaggcgcgcgacctctttgcgcagatcgcc 

ggcgc?g?gc^c?acctgcacgatcatcacc^^ 
SgctoSgagcccggacgagcgc 

GCcS?GG??AC^ 

rAGGTGCTCCTGGGCATCCCCTACGACCCCAAGAAGTACGATGTGTGGAGCATGGGCGTC 
rcCCGGCGCCAGAAACGCGGCGTGCTCTATCCCGAAGGCCTCGAGCTGTCCGAGCGCTGC 

gtagcgcgcaactgctggctgcgcgccggggactccggctag 
cIggtLagaa^ 

Sa?Saa^g1attgtggggtcaggggacagtggcgggaggagctggctcaccaccctgt 
^^a^aggccttataatttgtgattctgtggctttgtctaaa^ 

Stga^?ccagS^ 

™Stgatatactaacgacaaaaataatagaagtgaaca^ 

cagctagacacacttaagaccattaagaaagccaagaaat^ 
a^g??gga^ggcaggagacaggtgtgaggaggtgggcctttctgatctgccagcccat 

cg?ccgSccSggLcacaagaacatcatccag^ 

cgggaaStctgcctggtgatggagctcgctgagggaggggatgtctt^^^ 

Sa??g^gggccactgcctgaaagccgggccaaggccctcttccgtcagatggttgaggc 

ca^ccgSacScI?Sgct 

rTTrrAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCCAAGTC 
arAPCGGGAGCTGAGCCAG^ 

gSgggcat?ccc^^ 

?SgagSStcggccgattgccaggacctgctcaagaggctcc 

cSSgScttcaattgaagaagttagttggcatccatggctagc^ 

gcaatggcaagtgctctccaataaagtagggggagaaagcaaaccc 
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FIGURE 2 WWW 

GAAGCCAGACAACATCCTCATCACA^G^ 
CATGGCTCCTGAAGTCTGGGAGGGACACTACACAGCCAAGGC^ 

AGAAAACCCAAAGATGGAGTTGCACATCCCCCAAAAACGCAGGA^ 
GATCAAGCAGCTCTTGAAAGATATGT^ 

^ccttttcttttctttttttntttnccacn™ 

ACTTCAGAAGAGCAGGACACAATGCTGTGGACAG^ 

TGTGGGCAAGGCATATGTG^^TCACTTTTACJi^ TACTTTTC TCAGTTCTAC 
GAGGTGAAGAACCGCC 

SEQ ID NO: 9 8 -^ 49 ^^"'^C^GGAAACGGGGAAAGAGAACGGAGCCCGC 

AAGCTGAGGCCGGCGGCCCAGGCCATGGATCCGGCTGG^G 

TTCCTGGCCCGGCGGCGGCCGGATGGCG^ 
AGCCTCTTGGCGGAGATCGGGCGCGGCAGCTACGGC^ 

CGCAGTGGGGCCAGGGTGGCAGTCAAGA^ 
TTGGCACTAGCAGAATTCTGGGCCCTCACCAGTCTCAAG^ 

CAGTTTGAGGAGTGCGTCCTACAGCGC^ 

AAGAACTCACAGCTTTACCTGCGCCTGGTGGAGACC^ 

GGCTATGCTGAGGAGCCCTGCTATCTCTGGTTTGTCATG^ 

CTCAATCAGTATGTCCTGTCCCGGAGA^ 

GCTCCCGAAGTCTGGGAGGGACACTATACAGCCAAG^^^^^^^^^^^^^^^^^^ 
SSSS™— — — A^CCCTGXTGGTGAGGCGCTGC.AGAA 
AACCCAAAGATGGAGTT 
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AAGCAGCTCTTGAAAGACATGTTAGCTGCTAACCCACAGGACCGACCTGATGCTTTTGAA 
CTTGAAACCCGAATGGACCAGGTCACATGTGCTGCTTAAACTCCAGGGCTGAACGTCTTG 
GGTGTTTTTAAACTAGGTCGATCCTTCGGGACCCACAGTCTCATCGTGTCTCGGACAGGA 
TGGCAGAGGGTACAGGTGGTGGTGATCTCCTGACAGCTGGACCTCCCACAATGTGAAGCT 
CACGCTTGGGCTGCCCACTCTACCCTTCTCTTTCTCCTTCAGTAGAATAATAATTGTTTT 
TCTAAACATTAAACCATCAAGACTTCTGAAGAGCAGAAGGCTACACTCTG 

SEQ ID NO: 99_AA215311_H 

CGRCCGCGCTACGGAAAGCCGGAGGGGGGCGGGGCCGTCGGCGTAAGGGGGTGTGTCCGC 

GCGCACCACGGGGGCGCGCGCCGGCTGCTGACTGGAGGCGGCGGCAGCGGAGGCGCGAGC 

TGCCCGATAATGGCGGCCTGCAGAGCCCATGAGAGGGAGAAGCGGCAGCGTCTACCCTGA 

GAAACCTCGACCTTGAAGATGGTGAGTAGCCAGCCAAAGTACGATCTAATACGGGAGGTA 

GGCCGAGGTAGTTACGGTGTTGTGTATGAAGCAGTCATCAGAAAGACCTCTGCACGGGTG 

GCAGTGAAGAAAATTCGATGTCACGCACCTGAAAATGTTGAACTAGCCCTTCGTGAGTTC 

TGGGCACTAAGCAGTATCAAGAGCCAACATCCAAATGTGATTCACTTGGAGGAATGCATC 

CTACAAAAGGATGGGATGGTGCAAAAGATGTCCCACGGCTCTAATTCTTCCCTTTATTTA 

CAGCTTGTAGAAACTTCATTAAAAGGAGAAATTGCCTTTGATCCCAGAAGCGCCTATTAT 

TTGTGGTTTGTGATGGATTTTTGTGACGGAGGAGATATGAATGAGTATCTGTTGTCCAGG 

AAGCCCAATCGTAAAACTAACACCAGCTTCATGCTTCAGCTGAGCAGTGCCCTGGCTTTC 

TTGCATAAAAACCAGATCATCCACCGAGATCTTAAGCCTGATAACATCCTGATTTCTCAA 

ACCAGGTTGGATACCAGTGACTTGGAACCTACCCTCAAAGTGGCTGATTTTGGTCTAAGT 

AAAGTTTGTTCAGCCTCTGGGCAGAACCCAGAAGAACCTGTCAGTGTAAACAAGTGTTTC 

CTTTCCACAGCATGTGGAACAGATTTTTACATGGCTCCTGAAGTTTGGGAAGGACATTAC 

ACAGCAAAAGCTGACATCTTTGCTCTGGGGATTATCATCTGGGCAATGCTGGAAAGGATC 

ACATTCATAGACACAGAGACAAAGAAGGAACTCTTGGGGAGTTATGTAAAACAAGGAACT 

GAGATTGTGCCTGTTGGGGAGGCACTTCTGGAAAATCCCAAAATGGAACTTCTCATTCCT 

GTGAAGAAAAAATCTATGAATGGGCGAATGAAACAACTGATTAAGGAAATGCTGGCTGCA 

AACCCTCAGGATCGTCCAGATGCTTTTGAACTAGAACTCAGATTAGTACAAATTGCATTT 

AAAGATAGCAGCTGGGAAACGTGACACATATTATTTGCAAATACCATGGATGATATGCTG 

CTTCTGTTTAACAGTGATGCAACATTATGTGGCTGAAAAAGAATATAAAAAGCTAGACTC. 

TACCCTCTAAGGGTTTAGATTTTTTGTGGGATTTTTTTTTTCCTCATTTTTCTTAAATCC 

AAGTTGGCCGTTTTATTAGTATGTTTCAAATGTGTATTACCAATGTGGGTGTAAATTTTT 

AAAAAATGATTATTGATAGAAGTTTGGCAGGAAAATTCTTTAAGAGCTAACAAGAGAAGA 

GAGTCCAGTTTTCTGGAAATATGTCTTTAAGTATTTTAGACATTCCTCGTCAGTATTAGG 

AATTTCCATGGGAAAAGAGGTTTGCATGCTGGTAATGCAACCTTTGAAACTTTGTAAAGG 

AAACATATATGTATATATTTATGTATATGTAAGTATGTGAATGTGCGCATTTTGCATTCC 

ATATGAAAAAAATGCCACGTCTGTTTAAATTATTTGATGTAGGTTTGGGTTTTTGAGATT 

TGCTGGTGAAGTCAGTGACGAAAAATAAACCTTCCCTTATCTTCCTACTCTGCCCCTCCC 

CCTAATGAAATCATATTAAGTNGTTTTTCCTNNTTTTTTTGTAATATACAGCTTTTTTTT 

TAAGGCATCATTTTCGAGGGTCTAAAATTATCTGGTAAAACAAATGAAATTAAGTGATCC 

AAAGCTGCTGAAGTATGTTTGAACTCTCCAGTGCCCTATAGCTGCAAGAGTTGAATTAGT 

CATGCAGTCATATGGCAGCAGGTTGGTGATT 

SEQ ID NO: 100_AA018361_H 

GCGGGGCCTCCGTATCCCCACGTGGGCCCTGCAGGAACTGGCGGGGCGCGTGACCCGGCG 
AGGCCCAGAGACAGGGGAGGGGCGCCGGGAGCCGGGCGGATCCGCGTCCCCGATGCGCGC 
TGCATTTCCGGCGGGCGGCGCTGGGGGCAGCGTGGAGCCACCCAGTGCTCGGCCCGCCCC 
GCAACCCGCCGGAACCGCCGCCCGCAGCGAGGAAGCGCCCGCGCGGGCGCAGGCGGCCGG 
AATGGCGGGGCCCGGCTGGGGTCCCCCGCGCCTGGACGGCTTCATCCTCACCGAGCGCCT 
GGGCAGCGGCACGTACGCCACGGTGTACAAGGCCTACGCCAAGAAGGACACTCGTGAAGT 
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FIGURE 2YYY 

GGTAGCCATAAAGTGTGTAGCCAAGAAAAGTCTGAACAAGGCATCGGTGGAGAACCTCCT 
CACGGAGATTGAGATCCTCAAGGGCATTCGACATCCCCACATTGTGCAGCTGAAAGACTT 
TCAGTGGGACAGTGACAATATCTACCTCATCATGGAGTTTTGCGCAGGGGGCGACCTGTC 
TCGCTTCATCCATACCCGCAGGATTCTGCCTGAGAAGGTGGCGCGTGTCTTCATGCAGCA 
ATTAGCTAGCGCCCTGCAATTCCTGCATGAACGGAATATCTCTCACCTGGATCTGAAGCC 
ACAGAACATTCTACTGAGCTCCTTGGAGAAGCCCCACCTAAAACTGGCAGACTTTGGTTT 
CGCACAACACATGTCCCCGTGGGATGAGAAGCACGTGCTCCGTGGCTCCCCCCTCTACAT 
GGCCCCCGAGATGGTGTGCCAGCGGCAGTATGACGCCCGCGTGGACCTCTGGTCCATGGG 
GGTCATCCTGTATGAAGCCCTCTTCGGGCAGCCCCCCTTTGCCTCCAGGTCGTTCTCGGA 
GCTGGAAGAGAAGATCCGTAGCAACCGGGTCATCGAGCTCCCCTTGCGGCCCCTGCTCTC 
CCGAGACTGCCGGGACCTACTGCAGCGGCTCCTGGAGCGGGACCCCAGCCGTCGCATCTC 
CTTCCAGGACTTCTTTGCGCACCCCTGGGTGGACCTGGAGCACATGCCCAGTGGGGAGAG 
TCTGGGGCGAGCAACCGCCCTGGTGGTGCAGGCTGTGAAGAAAGACCAGGAGGGGGATTC 
AGCAGCCGCCTTATCACTCTACTGCAAGGCTCTGGACTTCTTTGTACCTGCCCTGCACTA 
TGAAGTGGATGCCCAGCGGAAGGAGGCAATTAAGGCAAAGGTGGGGCAGTACGTGTCCCG 
GGCTGAGGAGCTCAAGGCCATCGTCTCCTCTTCCAATCAGGCCCTGCTGAGGCAGGGGAC 
CTCTGCCCGAGACCTGCTCAGAGAGATGGCCCGGGACAAGCCACGCCTCCTAGCTGCCCT 
GGAAGTGGCTTCAGCTGCCATGGCCAAGGAGGAGGCCGCCGGCGGGGAGCAGGATGCCCT 
GGACCTGTACCAGCACAGCCTGGGGGAGCTACTGCTGTTGCTGCGGAGCCCCCGGGCCGG 
AGGCGGGAGCTGCTTCACACTGAGGTTCAGAACCTCATGGCCCGAGCTGAATACTTGAAG 
GAGCAGATGAGGGAATCTCGCTGGGAAGCTGACACCCTGGACAAAGAGGGACTGTCGGAA 
TCTGTTCGTAGCTCTTGCACCCTTCAGTGACCCTAGAAGAATGATTGGACAGATGTGAGC 
CATCTGGAGCAGAGGGGCACTAACCCAGGCTGACGCCAAGAATGAAGTGGCCCACTGCAG 
CCCTGGCGAGCAGGCTTCTTGGATGGACAGTGCTGAGACCCCCATATCCCAGAGTCCCCA 
GCCTCCCTCAGGTTACTCTGCACCCCACAGATGGTTTGATGGCTGTGCTGTATACTGGAG 
GGGAGGGCAGGACTCTGGGAGAACAGCACTTCTTTCATGAGACCTTTGTTACTCGGTGGT 
TACTGGGTCCTGTGCCTGTCCGTTTTGGGGCATGCAGCCCTCTATCATTTTTGGCTCCGA 
GAAGAGGGCAAGGGGCCCCCGCAGGGTACTTCTGTGCTTGCCCTCGCCCTGCCAGCAGGC 
AGCTGTGCCCCTGGCCTGCCCTTCCCGGGACCCCTTATTCCAACTCAGCTCCTCTTTGCA 
CTGGAATGGGGCACTCCAACACCCCTCAGGGACCACCCTCCCCACAGTATGCACTCAGCC 
CCACAGAACCCACCAGTCTTTCTGGGAACTCACACCTGCCCGCCATCTTGGTACTTTAGG 
TTAATCCCTCAAGCATGAAAGCTGGATCTTTTGGGGTTTAAG/^AGCCCAAGCCTTGTTCC 
TGCCCTGGCCTAGGGAGCACTCAGGAGGGTTCCTTGGTCCTCATCTCTCCCACCTCCGTT 
CCCTCTGGGCCCCACACTAGCCACAGCGCGGGCCTTGTGCTGGAGTTTGAGCCTGGGACA 
GGGAGAGGGAGGCTTGGAGACAGTCTGACCCAGTGCCCTCTAGGCCACCCACTTCTAGGC 
CTGCCCTGCCGCCGTGGAGCCCTGGGCAAGCTCTTTCCCCTTTCTGGGCCTGGGTCTCCC 
CATCTCTTCAATGGGGCTGATACCTTCACAGCCCACAGCATGGGCACTTATGAGGACAAA 
GTGAATTTAACCTGGAAAAGAATGTATTTGAGAGTTTCTTTTAAATAATCAGCGGGTGTT 
GGTGATTTGTAGCCCTTCTGCCCTTAAATGCTTCCTTGGGCAAGAGCTGTCTGTCCTCCC 
TGCAGGAGGCTGAGTGTGAAGAGTATCATTCATTGTTTCTCTATTAAATTATTTTCTCT 

SEQ ID NO: 101_AA3 11714_H 

TGGACCTGTCCTGAGGCAGAGGCCGAGATGCGCGCAACCGCGGGAGCAGCCAAGTGGACT 
GGACTCTTTTCTTGACTTAGCTACCAGGAGCTAGAGATGCTGTTATTCTATCGTATGTGA 
GAAGTCGGCCCAGAGATGGAAAACTTTATTCTGTATGAGGAGATCGGAAGAGGAAGCAAG 
ACTGTTGTCTATAAAGGGCGACGGAAGGGAACAATCAATTTTGTAGCCATTCTTTGTACT 
GATAAGTGCAGAAGGCCTGAAATAACCAACTGGGTCCGTCTCACCCGTGAAATAAAACAC 
AAGAATATTGTAACTTTTCATGAATGGTATGAAACAAGCAACCACCTCTGGCTAGTGXAT 
GAAAACCTCCCAGAAGATGTTGTGAGAGAATTTGGAATTGACCTGATTAGTGGATTACAT 
CATCTTCATAAACTTGGCATTCTCTTTTGTGACATTTCTCCTAGGAAGATACTCTTGGAA 



1-F/H3 
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GGGCCTGGCACACTGAAGTTTAGCAACTTTTGCTTGGCAAAAGTGGAAGGTGAAAATTTG 
GAAGAGTTCTTTGCTTTGGTGGCAGCAGAGGAAGGAGGAGGTGATAATGGGGAAAATGTC 
CTGAAGAAAAGCATGAATVAGTAGAGTCAAAGGATCTCCTGTATATACAGCACCAGAAGTT 
GTGAGGGGTGCTGACTTTTCCATCTCCAGTGACCTCTGGTCTTTGGGCTGTCTGCTTTAT 
GAAATGTTTTCAGGAAAACCTCCATTCTTCTCAGAAAGTGTTTCAGAATTAACTGAAAAG 
ATCTTATGTGAAGATCCTTTGCCACCTATTCCGAAAGATTCTTCTCGTCCTAAAGCTTCT 
TCAGATTTTATTAATTTGCTTGATGGGTTACTTCAAAGAGATCCTCAGAAAAGATTGACT 
TGGACAAGGCTACTGCAGCATTCATTTTGGAAGAAAGCTTTTGCTGGAGCAGATCAGGAA 
TCAAGCGTCGAAGATCTCAGTCTCAGCAGAAACACTATGGAGTGTTCTGGGCCACAAGAT 
TCCAAGGAGCTTTTGCAGAACTCTCAGAGTAGACAAGCAAAAGGGCACAAGAGTGGTCAA 
CCACTAGGTCACTCTTTCAGACTAGAAAATCCAACTGAGTTTCGGCCTAAGAGTACTCTT 
GAGGGTCAATTGAATGAATCCATGTTTCTTCTCAGTTCTCGTCCTACTCCCAGAACTAGC 
ACTGCAGTGGAAGTAAGTCCTGGTGAGGATATGACTCACTGTTCACCACAGAAGACTTCT 
CCTCTGACCAAGATTACAAGTGGACACCTGAGTCAGCAGGACCTGGAATCCCAGATGAGA 
GAGCTTATCTACACGGACTCAGATCTTGTTGTCACCCCCATTATCGACAATCCAAAGATA 
ATGAAACAGCCACCAGTTAAATTTGATGCAAAAATATTGCATCTACCAACATATTCAGTG 
GATAAGTTATTATTTCTGAAAGATCAAGATTGGAATGACTTTTTGCAACAAGTGTGCTCG 
CAGATCGACTCCACTGAGAAGAGCATGGGGGCCTCCCGAGCCAAGCTGAATCTCCTTTGC 
TATTTGTGCGTGGTGGCTGGTCACCAGGAGGTGGCCACCAGGCTCCTCCATTCCCCCCTG 
TTCCAATTGCTAATCCAGCATTTGCGGATAGCTCCAAACTGGGATATACGGGCCAAGGTT 
GCTCACGTGATTGGTTTACTGGCTTCGCACACAACTGAGCTCCAGGAAAATACACCTGTT 
GTTGAGACTACAAGCTCCATTGGAATCGGGATTTTGAACTGTCTTGTTCAACACTCCACT 
CCAGTGCCTAGACAGTGCCTTGTGTATGTATAGATACTGACAAATATTTCAAAATAAATA 
AAACTGTATCAGCATT 

SEQ ID NO: 102_SGK3 84_H 

TCTTTGGCCCACGTGCTGAGGGCGCGGCAGATCCTGACGGAGCCAGAAGTGCGCGACTAC 
CTGCGGGGCCTGGTCAGCGGCCTGCGCTACCTGCACCAGCGGTGCATCCTGCACCGC 

SEQ ID NO: 1 03_AA2 1 04 51_M SGK384_M 

GGTCTGCTGCATGGATAATGGACTGGAACACAGAAAGACCATGCAGGGTTCGGCTGTAGA 
AGGCCAGTATCTCCAGAGGCCAGAAGACACCATCAGATCTCCTGGGACTGGAGTTATAGA 
GGTTGTGAGCTGCCATGTTGAACCAAGCAGGTCACTGAGGGACACAGGCATGTGGATGGA 
AACCCTGCTGGGAGAAAAAAGAAACTGCTGAAGGGACTGACATGGGACAGCAACATGGAA 
CCAGGAATGGTCTCACGCATAGAGAGCTCCCCCGGGGCGTGGGGCTGCTGCTCGCCATGG 
CCCTTATGAACGTGGCGCTCTACCTCTGCCTTGATCAGCTTTTCATCTCCCCTGGACGAT 
CCACCGCGGACTCTAGGCGCTGTCCTCCGGGCTACTTCAGAATGGGGCGGATGAGAAACT 
GCTCACGCTGGCTGTCCTGTGAAGAGCTGAGGACAGAAGTCAGGCAGCTGAAGCGCGTTG 
GGGAGGGAGCCGTGAAGAGAGTCTTTCTGTCTGAATGGAAGGAACACAAAGTCGCTCTCT 
CCCGGCTCACCAGGCTGGAGATGAAGGAGGACTTCCTGCATGGGCTGCAGATGCTG7^AGT 
CTCTACAGAGTGAGCACGTGGTCACGCTGGTGGGCTACTGTGAGGAAGATGGCACTATTC 
TCACCGAATATCACCCCTTAGGTTCCTTGAGCAACCTGGAAGAAACACTAAACCTTTCAA 
AGTACCAAGACGTGAACACTTGGCAGCACAGGCTGCAGCTGGCCATGGAGTACGTCAGCA 
TCATTAACTATCTGCATCACAGCCCCCTGGGCACGAGGGTCATGTGTGACTCTAACGACC 
TGCCCAAAACATTGTCCCAGTACCTGCTAACAAGTAACTTCAGCATTGTGGCAAACGACC 
TGGACGCTCTGCCCCTGGTAGACCATGACTCTGGGGTACTTATAAAGTGTGGCCACAGAG 
AGCTCCATGGGGATTTTGTGGCTCCAGAGCAGCTGTGGCCCTACGGAGAAGACACGCCCT 
TCCAAGACGATCTCATGCCTTCCTACAATGAGAAGGTTGACATCTGGAAGATTCCAGATG 
TCTCCAGTTTCCTCTTGGGGCACGTGGAAGGGAGTGATATGGTTAGATTCCATTTGTTTG 
ATATCCATAAGGCGTGCAAGAGCCAGATCCCGGCAGAAAGACCCACTGCTCAGAACGTGC 
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TAGACGCTTACCAGAGGGTTTTCCATTCACTCCGAGACACTGTGATGTCGCAGACGAAAG 
AAATGCTGTAAAAATGAGCCATCGAGTGACGTGCTTGATGGCTGAATGGCATCCCAGCTG 
TTCCGCTCTTGATGATGGAAGAGCTTTGCATGGATGGATGTTGACCCTGGCTGTTCAGCC 
ACGTAGGCCTCCTCTACGTCTGCCTGCATGTTTGAGTGTTCTGCTCTCCTGGCAGCCCGG 
ATGGAAGCTGCCAAGCGAGAAAGCCTGGCTTCAGGATGCTCCCTGGTGAAGATGCAGAGG 
ATTCTGGATCTGCATAGTTTCAAGGGAGTGATCAAACGGTGACCTTGAAGACATGCTGCC 
TGCCTTGGTAACTTTTTATAGACTAGTAGGAAACAGAAATCTTTTGGGGGAGGGGGGGAC 
AACCCACTAGTTCCTCAGAGACAATTTCTTCTCATTCAGAAAGCCCTGTTGGAAGCTGGG 
GATGTTTTAACTCCGTGGCAGGGCACTTGCCTAGTTGTGTGCAAAGCCTTGGATCTGACC 
CATGGCATGTGCACACACACAAATGCTCAAAGAAAATCCCAGACGCCAGAAGTGTGCCCC 
TTTCTTGTCAATAAGGTCATTGTCAGTACCGGAGATGATTTTTTTTATGAAGCGTTTATG 
CTGACTCGTGTCACTGAGCCAAGTGTGCATGGTCGTTAGCTACTTTGTGGGTTCTTCTTT 
CTTTCTACCCTACTTCTTCCCTTTCCACCCCCTT^ACACTAGATAGGAGAGAGGAGAGAGA 
AAGGAAAGTGGGCACTGTTATATTGTTGGACGACTTCTTGCTGATTAAGGGGTGTCGAGT 
TCCTTGGAGCAATGATCTTTGCTGCCAAGATATCTCATTTCTTCTTGTTTCTTCTTCGCC 
CACGACCACTTCACAAACACCGACCAACAGCAAACAACAACCCACCCCGCTTCTCGGGGG 
CCCTAGCACTTATGTACTTCTGT^AAAGTCCCCAGAAATTCCAATCATCACACACTCAGAG 
AAACTGTCTGCTGCTGGCAAAACTACACCCCTGCTAGAGCATGAGGCAAATCATAGTCAG 
CTGCTGTGGACAGTCTGAAGCAGCCTGGCATCCCACACCTGAGATTAAAACAAAAACATT 
CTTACCTGTGTTTTGTTTTTGTTTTAAGAAACCAAAGTGCACCAAGATAGCATGCTCTTG 
AGATTGTGGCTGTCTAGAGATTTTTGGAACAGCAAGTTGAAGGAACTTTCTTACCTGCCT 
TGAATGGTGCTTTGAACTTCCTGCTGACCTGGAGTTTCTGTGTGAATATTTCTATCCAGT 
GTCCCCCTGTACCGGAAAGTACAAAGTCTGCTCTGGGCTTGCATGCCTGAACACTTTAAA 
ACACTGTGGAGCCAGGAATAATGGTACCCACCTGTAATCCCAGCACCTGGGAGACAGGAG 
GAACCAGGAGTTCAGGGTTATCCTGGGCTATATACCGTGACCCTGTCTACCCCCACACCC 
CAATAAAAAAACAAAAAGGTC 

SEQ ID NO: 104_SGK071_2_H 

GAGGTGGTGGCTGTGCAGATGATGGTGGAATGCATGGATGACCATTACGCCAGTCAGGCC 
CTGGAGGAGCTGATGCCACTGCTGAAGCTGCGGCACGCCCACATCTCTGTGTACCAGGAG 
CTGTTCATCACGTGGAATGGGGAGATCTCTTCTCTGTACCTCTGCCTGGTGATGGAGTTC 
AATGAGCTCAGCTTCCAGGAGGTCATTGAGGATAAGAGGAAGGCAAAGAAAATCATTGAC 
TCTGAGTGGATGCAGAATGTGCTGGGCCAGGTGCTGGACGCGCTGGAATACCTGCACCAT 
TTGGACATCATCCACAGGAATCTCAAACCCTCCAACATCATCCTCATCAGCAGTGACCAC 
TGC7VAACTGCAGGACCTGAGTTCCAATGTGCTAATGACAGACAAAGCCAAATGGAATATT 
CGTGCGGAGGAAGACCCCTTTCGTAAGTCCTGGATGGCCCCTGAAGCCCTCAACTTCTCC 
TTCAGCCAGAAATCAGACATCTGGTCCCTGGGCTGCATCATTCTGGACATGACCAGCTGC 
TCCTTCATGGATGGCACAGAAGCCATGCATCTGCGGAAGTCCCTCCGCCAGAGCCCAGGC 
AGCCTGAAGGCCGTCCTGAAGACAATGGAGGAGAAGCAGATCCCGGATGTGGAAACCTTC 
AGGAATCTTCTGCCCTTGATGCTCCAGATCGACCCCTCGGATCGAATAACGATAAAGGAC 
GTGGTGCACATCACCTTCTTGAGAGGCTCCTTCAAGTCCTCGTGCGTCTCTCTGACCCTG 
CACCGGCAGATGGTGCCTGCGTCCATCACCGACATGCTGTTAGAAGGCAACGTGGCCAGC 
ATTTTAGGTGATGCTGGGGACACAAAGGGGGAGCGTGCCCTGAAGCTCCTGTCCATGGCC 
TTGGCATCCTATTGTTTAGTTCCAGAGGGTTCATTATTTATGCCCCTGGCCTTGCTCCAC 
ATGCACGACCAGTGGCTCAGCTGTGACCAGGACAGAGTCCCTGGGAAGAGAGACTTTGCC 
TCCCTGGGGAAACTAGGGAAGCTGTTGGGCCCCATCCCAAAGGGTCTGCCGTGGCCCCCG 
GAGCTGGTGGAGGTGGTGGTCACGACCATGGAGCTACATGACAGGGTCCTCGATGTCCAG 
CTGTGTGCCTGCTCCCTGCTGCTGCACCTCCTGGGCCAAGCGCTGGTGCACCACCCGGAA 
GCCAAGGCTCCCTGCAACCAAGCCATCACCTCCACCCTGCTGAGTGCTCTTCAGAGCCAC 
CCCGAGGAGGAGCCACTTCTTGTCATGGTCTACAGCCTGCTAGCCATCACCACAACCCAG 
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GAGTCAGAGTCACTGTCAGAGGAGCTGCAGAACGCTGGGCTGCTGGAGCACATCCTGGAG 
CACCTCAACAGCTCCCTCGAAAGCAGGGACGTCTGCGCCAGCGGCCTGGGCCTGCTCTGG 
GCCCTCCTGCTGGACGACCCCATCTTGGCACTCCAGCGCCCCAGGAAAAAGAGAGCTCCA 
AACCACGGAAAGCCCGGGAAACCCAAGAACCCTGCCAGCACCCAAAGTATCATTGTGAAC 
AAGGCCCCCTTGGAGAAGGTCCCGGACCTCATCAGCCAGGTGTTGGCCACCTACCCTGCG 
GATGGGGAAATGGCAGAAGCCAGCTGCGGAGTCTTCTGGCTGCTGTCCCTGCTGGGCTGC 
ATCAAGGAGCAGCAGTTTGAACAAGTGGTGGCGCTGCTCCTGCAAAGCATCCGGCTGTGC 
CAGGACAGAGCCCTGCTGGTGAACAATGCCTACCGGGGACTGGCCAGCCTGGTGAAGGTG 
TCAGAGCTGGCGGCCTTCAAGGTGGTGGTGCAGGAGGAGGGCGGCAGTGGCCTCAGCCTC 
ATCAAGGAGACCTACCAGCTCCACAGGGACGACCCGGAGGTGGTGGAGAACGTGGGCATG 
CTGCTGGTCCACCTGGCTTCCTATGAGGAGATCCTGCCGGAGCTGGTGTCCAGTAGTATG 
AAGGCCCTGCTCCAGGAGATCAAGGAGCGCTTCACCTCCAGCCTGGTGAGTGACAGCAGC 
GCCTTCAGCAAACCAGGCCTCCCTCCAGGTGGAAGCCCCCAGCTGGGGTGCACCACGTCT 
GGGGGACTGGAATAG 

SEQ ID NO: 105_AA118352_M SGK071_M 

CAGAAGAAGACCCCTGCCAGAAGTCCTGGATGGCTCCTGAAGCTCTCAAATTCTCCTTCT 
CCACCAAATCCGACATCTGGTCTCTGGGCTGCATCATTCTAGACATGGCCACTTGCTCCT 
TCCTGAACGACACAGAAGCCATGCAACTGCGGAAGGCCATCCGCCATCATCCAGGCAGCG 
TGAAGCCCATCCTGAAAACCATGGAGGAGAAGCAAATCCCTGGTACAGATGTCTACTATT 
TGCTTCTGCCCTTCATGTTGCATATCAACCCCTCCGATCGACTGGCAATCAAGGATGTGA 
TGCAAGTCACCTTCATGAGCAACTCCTTCAAAAGCTCCTCTGTTGCGCTGAATATGCAGC 
GGCAGAAGGTCCCCATCTTCATCACTGACGTGCTGCTTGAAGGCAACATGGCCAACATCT 
TAGGCAGCTGGCTGTGTGCTTCCTTTGTGAACGACAGCAGGCACTGTGACTCAGGGATTG 
GCTCGCAGAGACTTGGGTTTGATTTTCAGTCAGTCTCTTGGACAGAGCACCCTCTGAAAG 
ATGTCATGCAGAATTTCTCCAGTCGACCAGAGGTCCAGCTCAGAGCCATTAACAAGTTGT 
TGACAATGCCAGAGGACCAGCTAGGGCTGCCATGGCCCACAGAGCTGCTGGAAGAGGTGA 
TCAGCATCATAAAGCAGCATGGGCGGATCCTGGATATTCTGCTCAGCACCTGCTCCCTTC 
TGCTGCGTGTTCTTGGCCAAGCACTGGCAAAGGACCCAGAAGCTGAGATCCCAAGGAGCA 
GTTTGATCATCTCCTTCCTGATGGATACCTTGCGGAGCCATCCTAACTCTGAAAGGCTTG 
TTAATGTGGTCTACAACGTGCTTGCCATTATTTCCAGCCAAGGACAGATCTCAGAAGAGC 
TGGAAGAGGAGGGGTTGTTTCAGCTTGCCCAAGAGAACCTGGAGCACTTCCAAGAGGACA 
GGGACATCTGCCTCTCTATCCTGAGCCTGCTCTGGTCCCTCCTGGTAGATGTTGTCACTG 
TGGACAAAGAGCCCTTGGAGCAGCTCTCTGGCATGGTCACCTGGGTGCTGGCTACTCATC 
CGGAGGACGTGGAAATAGCAGAGGCTGGCTGTGCGGTGCTCTGGCTGCTGTCCTTGTTGG 
GCTGCATAAAGGAGAGTCAGTTTGAGCAGGTGGTAGTGCTGCTCCTGAGAAGCATCCAGC 
TGTGCCCTGGCAGAGTACTGCTGGTGAACAATGCATTCCGTGGCTTGGCCAGCCTCGCAA 
AGGTGTCCGAACTGGTGGCCTTCCGAATAGTAGTACTGGAAGAGGGCAGCAGCGGCCTCC 
ACCTCATCCAAGATATCTACAAGCTCTACAAGGATGACCCTGAGGTGGTGGAGAACCTCT 
GCATGCTGTTGGCCCATCTGACCTCCTACAAGGAGATCCTGCCAGAGATGGAGTCTGGAG 
GCATCAAAGACCTAGTCCAGGTGATCCGGGGGCGCTTTACCTCCAGCCTGGAGCTGATTT 
CTTACGCTGATGAGATACTCCAGGTACTGGAAGCAAATGCACAACCTGGCGTCCAGGAGG 
ATCAGCTTGAGCCTCCTGCAGGGCAGGAAGCCCCACTGCAGGGAGAGCCCCTCTTCAGGC 
CCTGACATGCTGCCCTTCTGGTCCTGTGGTAAGAGAAAGTATCACTAGGTCCAGTATTAA 
TTTCGTACCCCATGGTGACTAATAAAAGAAGCCCTAGGCTGTTTCTGGC 

SEQ ID NO: 106_018653 . 9_H 

GGCCGGGGTCGGGGCGCGGGGCATGCGCGCGGGCTGGGCAGGGGGCCGGCGGGGCGCAGA 
GCGGAGCCGCCTCGGAGCCTGAGCCGCCCGGGGCCGGGGCCGGGGAGCCGCGCGGGGCCG 
GCCGGCCGGGGGGAGGGGAGCGATGCGGCGCCGGCGGGCGGCAGTGGCCGCGGGTTTCTG 
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CGCCTCCTTCCTGCTGGGCTCCGTCCTCAACGTGCTCTTCGCTCCGGGTCGGAGCCTCCG 

aggccaggccagtcccctgagccttcgccg'gccccgggtgcgggccgtcgcgggggccgc 

GGGGAGCTGGCCCGGCAGATCCGGGCGCGCTACGAGGAGGTGCAGCGCTATTCCCGCGGG 

GGCCCCGGGCCCGGGGCGGGCCGGCCGGAGCGGCGGCGCCTGATGGACCTGGCTCCGGGC 

GGGCCCGGCCTGCCGCGCCCCCGGCCCCCTTGGGCCCGGCCCCTGTCCGACGGCGCCCCA 

GGCTGGCCCCCGGCTCCCGGCCCAGGCTCCCCCGGCCCGGGCCCGCGCCTGGGCTGCGCC 

GCGCTTCGCAACGTGTCCGGCGCGCAGTACATGGGCTCAGGCTACACCAAGGCCGTGTAC 

CGGGTCCGCCTGCCCGGCGGTGCCGCGGTGGCGCTCAAGGCGGTGGACTTTAGCGGCCAC 

GATCTGGGCAGCTGCGTGCGCGAGTTCGGGGTACGGAGGGGCTGCTATCGGCTGGCGGCC 

CACAAGCTGCTTAAGGAGATGGTGCTGCTGGAGCGGCTGCGGCACCCCAACGTGCTGCAG 

CTCTATGGCTACTGCTACCAGGACAGCGAGGACATCCCAGACACCCTGACCACCATCACG 

GAGCTGGGCGCCCCTGTAGAAATGATCCAGCTGCTGCAAACTTCCTGGGAGGATCGATTC 

CGAATCTGCCTGAGCCTGGGCCGCCTCCTCCACCACCTGGCCCACTCCCCACTGGGCTCC 

GTCACTCTGCTGGACTTCCGCCCTCGGCAGTTTGTGCTGGTGGATGGGGAGCTCAAAGTG 

ACGGACCTGGATGACGCACGTGTGGAGGAGACGCCGTGTGCAGGCAGCACCGACTGCATA 

CTCGAGTTTCCGGCCAGGAACTTCACCCTGCCCTGCTCAGCCCAGGGCTGGTGCGAGGGC 

ATGAACGAGAAGCGGAACCTCTATAATGCCTACAGGTTTTTCTTCACATACCTCCTGCCT 

CACAGTGCCCCGCCTTCACTGCGTCCTCTGCTGGACAGCATCGTCAACGCCACAGGAGAG 

CTCGCCTGGGGGGTGGACGAGACCCTGGCCCAGCTGGAGAAGGTGCTGCACCTGTACCGG 

AGCGGGCAGTATCTGCAGAACTCCACGGCAAGCAGCAGTACCGAGTACCAGTGTATCCCA 

GACAGCACCATCCCCCAGGAAGACTACCGCTGCTGGCCATCCTACCACCACGGGAGCTGC 

CTCCTTTCAGTGTTCAACCTGGCTGAGGCTGTGGATGTCTGTGAGAGCCATGCCCAGTGT 

CGGGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGCAGCTGGTCTTTTTCAAG 

ACTGGATGGAGCCAAGTGGTCCCTGATCCCAACAAGACCACATATGTGAAGGCCTCTGGC 

TGACCTATCTGAGGGCTCGGCTGACCAGCTGACTATCCTCAGCAGCTGGGCTTGCCTGTG 

GAGGGAGTGACTTGCACTGGCAGCACTGCATGTCACCTGGGAACCCCTGCAGACAAAGCT 

AACATCCCAGACAGACAGATGTGACCAGGACAAACGTGCAATAATGCCAAATGTTAAAAT 

GTGAGTTTACCAGCCTAGCTATGGGACTGCTGGCTCCTAGTCCAGGAATCATGGGGGTAT 

GACTGCCTCTCCAACCCTGTGGGCTGTAAGCAAGCTCAGGCTAGTCTCCCCACTGGGGGC 

TGTGCCCCTCCCTGGGACGGTTCCGTGGGCAGCCCCATCACTGTGTTCAATAGTGTGAGA 

ATGTAGCTAAAGCCCCTGCTGCTGCTGCTGCACATGCCACAGCAGGCGGTGGGGGCTGCG 

TGGGGACAATCCATCGTGGAGTGTTCTCTCAGCTTAGGTCTGGACAGGAGACTTGGCGGG 

AGATGCTCCAGGATGTGGGTGATTCTGTACCTGGGGAGGCTATCTCTGACCTCCCGACAG 

GGGACACTCCCAGGCCAGCCCAGGGGTCAGGGGCAGAGGTGCACACCTCAGCATGAGCCA 

AGACTGGGGTCAGGGAGCAGGTGTGGTTTGAGCCAGGACCTGGGGCGGGGGTGGGGCCGG 

GGCCTTTCTGCCTCATTTGCTTTCAATGAAAGCCTCAAAGCAGCCAAAACCAGGCTTTCC 

CCCTTCCTCGAGTTTGAATATCCAGAATCTTTTGTACTTCTTGTTGGTTAAATTGTTTAT 

TTTTGTAAAAAATAAAATAAAATTAGTTAATAAAATGATGTTTCACAGCAAACTCTTCCC 



SEO ID NO: 107 AA396601_M 

CCACGCGTCCGGGCTGCGCCGCGCTCCGCAACGTGTCTGGCGCGCAGTACGTGGGCTCAG 
GCTACACTAAGGCTGTGTACCGGGTCCGCCTGCCCGGCGGCGCCGCGGTGGCGCTTAAAG 
CAGTGGACTTCAGCGGCCACGATCTGGGCAGCTGCGTGCGCGAGTTCGGGGCGCGAAGGG 
GCTGCTATCGCCTGGCGGCCCACAAGCTGCTCAAAGAGATGGTGCTGCTGGAGCGGCTGC 
GGCACCCCAACGTGCTGCAGCTCTATGGCTATTGCTACCAGGACAGTGAGGGCATCCCAG 
ACACGCTGACCACCATCACAGAGCTGGGTGCCCCTGTGGAGATGATCCAGCTGTTGCAGA 
CTTCCTGGGAGGATCGATTCCGAATCTGCCTCAGCCTTGGCCGCCTCCTCCACCACCTGG 
rCCACTCCCCGCTGGGCTCGGTCACCCTGCTTGACTTCCGCCCTCGGCAGTTTGTGCTAG 
TGAACGGGGAGCTGAAAGTGACAGACCTGGATGATGCCCGCGTGGAAGAGACACCGTGCA 
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CCAGCAGTGCCGACTGCACGCTAGAGTTTCCAGCCAGGAACTTCAGCCTGCCCTGCTCGG 
CCCAGGGCTGGTGCGAGGGCATGAATGAGAAACGGAACCTCTACAATGCCTACAGGTTCT 
TCTTCACATACCTCCTGCCACACAGTGCCCCGCCTTCCCTCCGACCTCTCCTGGATAGCA 
TCGTCAATGCCACGGGAGAGCTCGCCTGGGGGGTGGATGAGACCCTGGCCCAGCTGGAGA 
CAGCGCTACACTTGTTCCGAAGTGGGCAGTACCTGCAGAACTCTACAAGCAGCAGGGCTG 
AGTACCAGCGCATCCCGGACAGTGCCATCACACAGGAGGACTATCGCTGCTGGCCATCCT 
ATCACCACGGCGGCTGCCTCCTGTCCGTGTTCAACCTGGCTGAGGCTATAGATGTCTGTG 
AGAGCCATGCTCAGTGTCGTGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGA 
AGCTGGTCTTTTTTAAGACTGGATGGAACCAAGTGGTCCCTGATGCCGGCAAGACCACAT 
ATGTGAAGGCCCCTGGTTGACTGGTTGTGGGCTCAGCTGACCAGCTGGGCTTGCCTGCTG 
CAGGCGTGACTTGCATCCCACCTGGGAACCCCTGCAGACAAAAGCTAGCTCCCAGAGCAA 
CTGATGTGACCAGGACAAAACGTGCAATATGCAAAAATGTTAAAATGTGAGTTTGCCAGC 
TTCAGTCCCAGACTGGTTGGAACCCGATTGCCTCTCTGGAGCTGTAGGCTGTGAGCAGGG 
CTCAGGCTGGTCTTAACTGGGACAGTCCCGTGGGCAGCCCATTACTGCATTCATGCTTTG 
AGAATGTAGCCAGAACACTGCTGCTGCATAAGCCACCGTGGGCAGGAGCTGCCTGGGGAC 
AACCAGTCTCAGAGTGCTCTCTCAGCTCAGCTCCGCTCCAAATGGAGAGCGCGGGATGCG 
GAGATGTGAGTGAACCAGCACTGGGAAGAAGGCTCTCGGGCCTCTCCCTAGAGGTTGCTC 
CTAGGCCAGCCCCGAGGCCGTGGGCAGCAGTGCTCGCATCCATATGAGCCAAGACTAGAG 
TGGAGGAGCAGATTGCATTTGAGCCAGGACTGGGGTGGGGGTAGGGTCGGGGCCTCTCTG 
CCTCATTTGCTTTCAGTGAAAGCCAGGGAGCAGCCGCAGCCAGGCTCCTCCCACTCCTGG 
AGGCCAGGCTCCTCCCCCTCCTGGAGGCCAGGCTCCTCCCCCTCCTGGAGTTTGCGTACC 
CAGAAGCTTTTATACTTCTCGTTCATTAAATTGTTTATTTTTGTAAAAAAAAAAATTAAT 
CAATTAATAAAATGATGTTTTGTGAC 

SEQ ID NO: 108_VRK3_H 

ATGATCTCCTTCTGTCCAGACTGTGGCAAAAGTATCCAAGCGGCATTCAAATTCTGCCCC 
TACTGTGGAAATTCTTTGCCTGTAGAGGAGCATGTAGGGTCCCAGACCTTTGTCAATCCA 
CATGTGTCATCCTTCCAAGGCTCAAAGAGAGGGCTGAACTCCAGTTTTGAAACCTCTCCT 
AAGAAAGTGAAATGGTCCAGCACCGTCACCTCTCCCCGATTATCCCTCTTCTCAGATGGT 
GACAGTTCTGAGTCTGAAGATACTCTGAGTTCCTCTGAGAGATCCAAAGGCTCCGGGAGC 
AGACCCCCAACCCCCAAAAGCAGCCCTCAGAAGACCAGGAAGAGCCCTCAGGTGACCAGG 
GGTAGCCCTCAGAAGACCAGCTGTAGCCCTCAGAAGACCAGGCAGAGCCCTCAGACGCTG 
AAGCGGAGCCGAGTGACCACCTCACTTGAAGCTTTGCCCACAGGGACAGTGCTGACAGAC 
AAGAGTGGGCGACAGTGGAAGCTGAAGTCCTTCCAGACCAGGGACAACCAGGGCATTCTC 
TATGAAGCTGCACCCACCTCCACCCTCACCTGTGACTCAGGACCACAGAAGCAAAAGTTC 
TCACTCAAACTGGATGCCAAGGATGGGCGCTTGTTCAATGAGCAGAACTTCTTCCAGCGG 
GCCGCCAAGCCTCTGCAAGTCAACAAGTGGAAGAAGCTGTACTCGACCCCACTGCTGGCC 
ATCCCTACCTGCATGGGTTTCGGTGTTCACCAGGACAAATACAGGTTCTTGGTGTTACCC 
AGCCTGGGGAGGAGCCTTCAGTCGGCCCTGGATGTCAGCCCAAAGCATGTGCTGTCAGAG 
AGGTCTGTGCTGCAGGTGGCCTGCCGGCTGCTGGATGCCCTGGAGTTCCTCCATGAGAAT 
GAGTATGTTCATGGAAATGTGACAGCTGAAAATATCTTTGTGGATCCAGAGGACCAGAGT 
CAGGTGACTTTGGCAGGCTATGGCTTCGCCTTCCGCTATTGCCCAAGTGGCAAACACGTG 
GCCTACGTGGAAGGCAGCAGGAGCCCTCACGAGGGGGACCTTGAGTTCATTAGCATGGAC 
CTGCACAAGGGATGCGGGCCCTCCCGCCGCAGCGACCTCCAGAGCCTGGGCTACTGCATG 
CTGAAGTGGCTCTACGGGTTTCTGCCATGGACAAATTGCCTTCCCAACACTGAGGACATC 
ATGAAGCAAAAACAGAAGTTTGTTGATAAGCCGGGGCCCTTCGTGGGACCCTGCGGTCAC 
TGGATCAGGCCCTCAGAGACCCTGCAGAAGTACCTGAAGGTGGTGATGGCCCTCACGTAT 
GAGGAGAAGCCGCCCTACGCCATGCTGAGGAACAACCTAGAAGCTTTGCTGCAGGATCTG 
CGTGTGTCTCCATATGACCCCATTGGCCTCCCGATGGTGCCCTAG 
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SEQ ID NO: 109_S71575_M VRK3_M 

CCATCCCCACCTGTATCGGCTTTGGCATTCACCAGGACAAGTACAGGTTCCTAGTATTCC 
CCAGCCTGGGGAGGAGCCTTCAGTCAGCCCTGGATGACAACCCAAAGCATGTGGTATCAG 
AGAGATGTGTGCTTCAGGTGGCCTGCAGGCTGCTGGATGCTCTGGAGTATCTCCATGAAA 
ATGAGTATGTTCACGGGAACCTGACAGCTGAGAATGTCTTTGTGAATCCAGAGGATCTGA 
GCCAGGTGACCCTGGTGGGCTATGGCTTCACCTACCGATACTGCCCAGGTGGCAAACACG 
TGGCCTACAAAGAAGGCAGCAGGAGTCCACACGATGGGGACTTGGAGTTCATTAGCATGG 
ACCTGCACAAGGGATGCGGACCCTCCCGCCGCAGCGATCTCCAGACCTTGGGCTACTGTA 
TGCTCAAGTGGCTTTATGGGTCCCTGCCATGGACAAATTGCCTTCCCAACACCGAAAAGA 
TAACTAGGCAGAAGCAGAAGTATCTGGACAGCCCCGAGCGCCTCGTGGGACTGTGTGGCC 
GCTGGAACAAGGCCTCAGAGACCCTGCGGGAGTACCTGAAGGTGGTGATGGCCCTCAATT 
ATGAGGAGAAGCCACCCTATGCCACGCTGAGGAACAGCCTAGAAGCTCTGCTGCAGGATA 
TGCGGGTGTCACCCTATGACCCTCTGGACCTCCAGATGGTGCCTTAGATGGAATCCAGAG 
CTTCCGACTTGCAGCTTGAAGTAGAACATGAAGTAGTGTGACTGGAGGCCTGTTTGAACT 
CATAGCTCCTAAAAGAATCCCTTGAATGTGCATTCTCACCGCTCCCTTAGGACATATGAA 
TCAGCACTTGTGTTGGGGAACCTGAGTCATGTCATGTAATGTGAAACTCCTCCCTGTCTC 
AGCTCTGGCAGCTGTGGATGGAGGTAAGTGGATGCTGGCGGCGGCGGCGGCAGCAGCCAC 
TCCACTCCCTATGGCATTTCTGTGATGGCATAATAAACTGTTTTTAATC 

SEQ ID NO: 110_AA4 5427_H 

ATGGGCCACGCGCTGTGTGTCTGCTCTCGGGGAACTGTCATCATTGACAATAAGCGCTAC 
CTCTTCATCCAGAAACTGGGGGAGGGTGGGTTCAGCTATGTGGACCTAGTGGAAGGGTTA 
CATGATGGACACTTCTACGCCCTGAAGCGAATCCTGTGTCACGAGCAGCAGGACCGGGAG 
GAGGCCCAGCGAGAAGCCGACATGCATCGCCTCTTCAATCACCCCAACATCCTTCGCCTC 
GTGGCTTACTGTCTGAGGGAACGGGGTGCTAAGCATGAGGCCTGGCTGCTGCTACCATTC 
TTCAAGAGAGGTACGCTGTGGAATGAGATAGAAAGGCTGAAGGACA7VAGGCAACTTCCTG 
ACCGAGGATCAAATCCTTTGGCTGCTGCTGGGGATCTGCAGAGGCCTTGAGGCCATTCAT 
GCCAAGGGTTATGCCCACAGAGACTTGAAGCCCACCAATATATTGCTTGGAGATGAGGGG 
CAGCCAGTTTTAATGGACTTGGGTTCCATGAATCAAGCATGCATCCATGTGGAGGGCTCC 
CGCCAGGCTCTGACCCTGCAGGACTGGGCAGCCCAGCGGTGCACCATCTCCTACCGAGCC 
CCAGAGCTCTTCTCTGTGCAGAGTCACTGTGTCATCGATGAGCGGACTGATGTCTGGTCC 
CTAGGCTGCGTGCTATATGCCATGATGTTTGGGGAAGGCCCTTATGACATGGTGTTCCAA 
AAGGGTGACAGTGTGGCCCTTGCTGTGCAGAACCAACTCAGCATCCCACAAAGCCCCAGG 
CATTCTTCAGCATTGCGGCAGCTCCTGAACTCGATGATGACCGTGGACCCGCATCAGCGT 
CCTCACATTCCTCTCCTCCTCAGTCAGCTGGAGGCGCTGCAGCCCCCAGCTCCTGGCCAA 
CATACTACCCAAATCTGA 

SEQ ID NO: 111_H05721_H 

CCCTGAGGCACCGCCCCAAGTTTGGTGTGACCGGCGGGGGACGCCGGTGGTGGCGGCAGC 
GACGGCTGCGGGGGCACCGGGCCGCGGCGCCACCATGGCGGTGCGACAGGCGCTGGGCCG 
CGGCCTGCAGCTGGGTCGAGCGCTGCTGCTGCGCTTCACGGGCAAGCCCGGCCGGGCCTA 
CGGCTTGGGGCGGCCGGGCCCGGCGGCGGGCTGTGTCCGCGGGGAGCGTCCAGGCTGGGC 
CGCAGGACCGGGCGCGGAGCCTCGCAGGGTCGGGCTCGGGCTCCCTAACCGTCTCCGCTT 
CTTCCGCCAGTCGGTGGCCGGGCTGGCGGCGCGGTTGCAGCGGCAGTTCGTGGTGCGGGC 
CTGGGGCTGCGCGGGCCCTTGCGGCCGGGCAGTCTTTCTGGCCTTCGGGCTAGGGCTGGG 
CCTCATCGAGGAAAAACAGGCGGAGAGCCGGCGGGCGGTCTCGGCCTGTCAGGAGATCCA 
GGCAATTTTTACCCAGAAAAGCAAGCCGGGGCCTGACCCGTTGGACACGAGACGCTTGCA 
GGGCTTTCGGCTGGAGGAGTATCTGATAGGGCAGTCCATTGGTAAGGGCTGGAGTGCTGC 
TGTGTATGAAGCCACCATGCCTACATTGCCCCAGAACCTGGAGGTGACAAAGAGCACCGG 
GTTGCTTCCAGGGAGAGGCCCAGGTACCAGTGCACCAGGAGAAGGGCAGGAGCGAGCTCC 
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GGGGGCCCCTGCCTTCCCCTTGGCCATCAAGATGATGTGGAACATCTCGGCAGGTTCCTC 
CAGCGAAGCCATCTTGAACACAATGAGCCAGGAGCTGGTCCCAGCGAGCCGAGTGGCCTT 
GGCTGGGGAGTATGGAGCAGTCACTTACAGAAAATCCAAGAGAGGTCCCAAGCAACTAGC 
CCCTCACCCCAACATCATCCGGGTTCTCCGCGCCTTCACCTCTTCCGTGCCGCTGCTGCC 
AGGGGCCCTGGTCGACTACCCTGATGTGCTGCCCTCACGCCTCCACCCTGAAGGCCTGGG 
CCATGGCCGGACGCTGTTCCTCGTTATGAAGAACTATCCCTGTACCCTGCGCCAGTACCT 
TTGTGTGAACACACCCAGCCCCCGCCTCGCCGCCATGATGCTGCTGCAGCTGCTGGAAGG 
CGTGGACCATCTGGTTCAACAGGGCATCGCGCACAGAGACCTGAAATCCGACAACATCCT 
TGTGGAGCTGGACCCAGACGGCTGCCCCTGGCTGGTGATCGCAGATTTTGGCTGCTGCCT 
GGCTGATGAGAGCATCGGCCTGCAGTTGCCCTTCAGCAGCTGGTACGTGGATCGGGGCGG 
AAACGGCTGTCTGATGGCCCCAGAGGTGTCCACGGCCCGTCCTGGCCCCAGGGCAGTGAT 
TGACTACAGCAAGGCTGATGCCTGGGCAGTGGGAGCCATCGCCTATGAAATCTTCGGGCT 
TGTCAATCCCTTCTACGGCCAGGGCAAGGCCCACCTTGAAAGCCGCAGCTACCAAGAGGC 
TCAGCTACCTGCACTGCCCGAGTCAGTGCCTCCAGACGTGAGACAGTTGGTGAGGGCACT 
GCTCCAGCGAGAGGCCAGCAAGAGACCATCTGCCCGAGTAGCCGCAAATGTGCTTCATCT 
AAGCCTCTGGGGTGAACATATTCTAGCCCTGAAGAATCTGAAGTTAGACAAGATGGTTGG 
CTGGCTCCTCCAACAATCGGCCGCCACTTTGTTGGCCAACAGGCTCACAGAGAAGTGTTG 
TGTGGAAACT^AAAATGAAGATGCTCTTTCTGGCTAACCTGGAGTGTGAAACGCTCTGCCA 
GGCAGCCCTCCTCCTCTGCTCATGGAGGGCAGCCCTGTGATGTCCCTGCATGGAGCTGGT 
GAATTACTAAAAGAACATGGCATCCTCTGTGTCGTGATGGTCTGTGAATGGTGAGGGTGG 
GAGTCAGGAGACAAGACAGCGCAGAGAGGGCTGGTTAGCCGGAAAAGGCCTCGGGCTTGG 
CAAATGGAAGAACTTGAGTGAGAGTTCAGTCTGCAGTCCTCTGCTCACAGACATCTGAAA 
AGTGAATGGCCAAGCTGGTCTAGTAGATGAGGCTGGACTGAGGAGGGGTAGGCCTGCATC 
CACAGAGAGGATCCAGGCCAAGGCACTGGCTGTCAGTGGCAGAGTTTGGCTGTGACCTTT 
GCCCCTAACACGAGGAACTCGTTTGAAGGGGGCAGCGTAGCATGTCTGATTTGCCACCTG 
GATGAAGGCAGACATCAACATGGGTCAGCACGTTCAGTTACGGGAGTGGGAAATTACATG 
AGGCCTGGGCCTCTGCGTTCCCAAGCTGTGCGTTCTGGACCAGCTACTGAATTATTAATC 
TCACTTAGCGAAAGTGACGGATGAGCAGTAAGTAAGTAAGTGTGGGGATTTAAACTTGAG 
GGTTTCCCTCCTGACTAGCCTCTCTTACAGGAATTGTGAAATATTAAATGCAAATTTACA 
ACTGCAGATGACGTATGTGCCTTGAACTGAATATTTGGCTTTAAGAATGATTCTTCTTAT 
ACTCTGAAGGTGAGAATATTTTGTGGGCAGGTATCAACATTGGGGAAGAGATTTCATGTC 
TAACTAACTAACTTTATACATGATTTTTAGGAAGCTATTGCCTAAATCAGCGTC7VACATG 
CAGTAAAGGTTGTCTTCAACTGACAAAA 

SEQ ID NO: 112_AI086865_H 

AATGAGATGGAGAAGTACGAGCGGATCCGAGTGGTGGGGAGAGGTGCCTTCGGGATTGTG 
CACCTGTGCCTGCGAAAGGCTGACCAGAAGCTGGTGATCATCAAGCAGATTCCAGTGGAA 
CAGATGACCAAGGAAGAGCGGCAGGCAGCCCAGAATGAGTGCCAGGTCCTCAAGCTGCTC 
AACCACCCCAATGTCATTGAGTACTACGAGAACTTCCTGGAAGACAAAGCCCTTATGATC 
GCCATGGAATATGCACCAGGCGGCACTCTGGCTGAGTTCATCCAAAAGCGCTGTAATTCC 
CTGCTGGAGGAGGAGACCATCCTGCACTTCTTCGTGCAGATCCTGCTTGCACTGCATCAT 
GTGCACACCCACCTCATCCTGCACCGAGACCTCAAGACCCAGAACATCCTGCTTGACAAA 
CACCGCATGGTCGTCAAGATCGGTGATTTCGGCATCTCCAAGATCCTTAGCAGCAAGAGC 
ACCCCATGCTATATCTCCCCTGAGCTGTGTGAGGGCAAGCCCTACAACCAGAAGAGTGAC 
ATCTGGGCCCTGGGCTGTGTCCTCTACGAGCTGGCCAGCCTCAAGAGGGCTTTCGAGGCT 
GCGAACTTGCCAGCACTGGTGCTGAAGATCATGAGTGGCACCTTTGCACCTATCTCTGAC 
CGGTACAGCCCTGAGCTTCGCCAGCTGGTCCTGAGTCTACTCAGCCTGGAGCCTGCCCAG 
CGGCCACCACTCAGCCACATCATGGCACAGCCCCTCTGCATCCGTGCCCTCCTCAACCTC 
CACACCGACGGCAGAGAAGTCCGTGGCCCCCAGCAACACAGGGAGCAGGACCACCAGTGT 
CCGCTGCAGAGAGGCATCATCATGACATTCGGCAGCGGCAGCAATGGGTGCCTAGGCCAT 
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GGCAGCCTCACTGACATCAGCCAGCCCACCATTGTGGAGGCTTTGTTGGGCTATGAAATG 

GTGCAGCAAGTGGAGGAGGCCCTGAGCTTCACACTACTAGGCTCTGCACCCCTGGACCAG 

GAGCCTCTGCTGAGTATAGACCTGGGCACTGCTCACTCAGCTGCTGTGACTGGTGAGGAG 

GACTTGGGCTCTGGAGATGTAAACAGGTTACCCAGCTGGGAGAGAGGACATCTGCTGGCT 

GGTGTGGCGTCCAGCACTGATGTGTCTACCTTCTCTGAAGGTGACTGCAAGGAGCCTGAC 

AAGTGCTGCTGGAGACACAAGCAGTGCACTGGGCACATCATCTACCCTTTCGCCTCTGAC 

TGTGTCCGCCACAGCCTGCACCTACACTCTGTCAACCACTGCAACTGTAATTCTAGGCTG 

AAGGACTCTTCAGAGGATAGCAGCAGCTCCCGGGGCGCGGGCCCAACCTGCTCCCATGTC 

ATCGAGTCCCCTTGCTTTGAGCTCACACCGGAGGAGGAGCATGTGGAGCGATTCCGGTAT 

GGCTGGTGCAAAAGCTACAGACCTGTCTCTGTGGCAGTGATCCACCATCCACTCTACCAT 

GAGTGTGGGGCAGATGATCTAAATGXXAAGAAGAGGAAGAGGAGGAGGAGGAAAAGCAAG 

CCCCCCATCCCGACACAGGTGGGGCCCGCCACCGCCTCCCCTGACCTAGGCACCAGCATG 

GCCACTGGTACCCCTGACTCCACAGCGCCCATCACCATCTGGCGCTCTGAGAGCCCCACA 

GGGAAGGGTCAGGGCAGCAAGGTGATCAAGAAGGTAAAGAAGAAAAAGGAAAAAGAGAAA 

GACAAGGAGGAGATGGATGAGAAGGCAAAGCTGAAGAAAAAAGCCAAGAAAGGCCAGTTG 

ACTAAGAAGAAAAGCCCGGTTAAATTGGAGCCTTCCCCGCCAGACGTGAGCCGATCATTA 

AGCGCAAGACAGCTGGCCAGGATGTCCGAGTCCAGCCCAGAAAGCCGGGAAGAGCTGGAG 

AGCGAGGACAGTTACAATGGCCGGGGGCAGGGAGAACTGTCCAGCGAGGATATTGTGGAA 

TCATCATCGCCCAGGAAGAGAGAGAACACAGTCCAGGCCAAAAAGACAGGGGCAAAGCCC 

TCACAAGCCAGGAAGGTAAACAAGAGAAAATCTCCCCCAGGATCAAACCCCAACCTCAGT 

TGAGGCCAGGGTGGTCAGGGTGCAGAATAAATGCCATCGAGCCTGTGGCTGGCCCTCTGC 

TGCTGTTCTCTCCCTCCAACCTGGCTGTTTCTTGCGGGGCAAGGGGTGGGCTCAGGGCTG 

CAGGGGTTTCTCAAAGGCAATCCAGCTTTCACAAAGGAAGCCCATGGGAAGGCAGGTGGG 

AGGGAAAGGAAGGGGCACAGCCCTATTTCTTCCTACCTGCTAGGACAAGGTGGAAGAGTG 

TATCTGGGGTGGGAAGGAGGGCTTCCCCTCTCTGCTGCGAGAGACTGGTCTGTGTGAAAT 

CCACTTCTGGGACAGGCAGTACTGTCTGCAGCGATACCCCCAATAAACGGAACTTTTTAA 

CCC 

SEO ID NO: 113 AA836348_H 

ATGTCGGTGCTGGGCGAGTACGAGCGACACTGCGATTCCATCAACTCGGACTTTGGGAGC 

GAGTCCGGGGGTTGCGGGGACTCGAGTCCGGGGCCTAGCGCCAGTCAGGGGCCGCGAGCC 

GGCGGCGGCGCGGCGGAGCAGGAGGAACTGCACTACATCCCCATCCGCGTCCTGGGCCGC 

GGCGCCTTCGGGGAAGCCACGCTGTACCGCCGCACCGAGGATGACTCACTGGTTGTGTGG 

AAGGAAGTCGATTTGACCCGGCTGTCTGAGAAGGAACGTCGTGATGCCTTGAATGAGATA 

GTTATTCTGGCACTGCTGCAGCACGACAACATTATTGCCTACTACAATCACTTCATGGAC 

AATACCACGCTGCTGATTGAGCTGGAATATTGTAATGGAGGGAACCTGTATGACAAAATC 

CTTCGTCAGAAGGACAAGTTGTTTGAGGAAGAGATGGTGGTGTGGTACCTATTTCAGATT 

GTTTCAGCAGTGAGCTGCATCCATAAAGCTGGAATCCTTCATAGAGATATAAAGACATTA 

AATATTTTTCTGACCAAGGCAAACCTGATAAAACTTGGAGATTATGGCCTAGCAAAGAAA 

CTTAATTCTGAGTATTCCATGGCTGAGACGCTTGTGGGAACCCCATATTACATGTCTCCA 

GAGCTCTGTCAAGGAGTAAAGTACAATTTCAAGTCTGATATCTGGGCAGTTGGCTGCGTC 

ATTTTTGAACTGCTTACCTTAAAGAGGACGTTTGATGCTACAAACCCACTTAACCTGTGT 

GTGAAGATCGTGCAAGGAATTCGGGCCATGGAAGTTGACTCTAGCCAGTACTCTTTGGAA 

TTGATCCAAATGGTTCATTCGTGCCTTGACCAGGATCCTGAGCAGAGACCTACTGCAGAT 

GAACTTCTAGATCGCCCTCTTCTCAGGAAACGCAGGAGGTCAAGCACTGTGACTGAAGCA 

CCCATTGCTGTAGTAACATCACGAACCAGTGAAGTCTATGTTTGGGGTGGTGGAAAATCC 

ACCCCCCAGAAACTGGATGTTATCAAGAGTGGCTGTAGTGCCCGGCAGGTCTGTGCAGGG 

AATACCCACTTTGCTGTGGTCACAGTGGAGAAGGAACTGTACACTTGGGTGAACATGCAA 

GGAGGCACTAAACTCCATGGTCAGCTGGGCCATGGAGACAAAGCCTCCTATCGACAGCCA 

AAGCATGTGGAAAAGTTGCAAGGCAAAGCTATCCGTCAGGTGTCATGTGGTGATGATTTC 
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ACTGTCTGTGTGACTGATGAGGGTCAGCTCTATGCCTTCGGATCAGATTATTATGGCTGC 
ATGGGGGTGGACAAAGTTGCTGGCCCTGAAGTGCTAGAACCCATGCAGCTGAACTTCTTC 
CTCAGCAATCCAGTGGAGCAGGTCTCCTGTGGAGATAATCATGTGGTGGTTCTGACACGA 
AACAAGGAAGTCTATTCTTGGGGCTGTGGCGAATATGGACGACTGGGTTTGGATTCAGAA 
GAGGATTATTATACACCACAAAAGGTGGATGTTCCCAAGGCCTTGATTATTGTTGCAGTT 
CAATGTGGCTGTGATGGGACATTTCTGTTGACCCAGTCAGGCAAAGTGCTGGCCTGTGGA 
CTCAATGAATTCAATAAGCTGGGTCTGAATCAGTGCATGTCGGGAATTATCAACCATGAA 
GCATACCATGAAGTTCCCTACACAACGTCCTTTACCTTGGCCAAACAGTTGTCCTTTTAT 
AAGATCCGTACCATTGCCCCAGGCAAGACTCACACAGCTGCTATTGATGAGCGAGGCCGG 
CTGCTGACCTTTGGCTGCAACT^AGTGTGGGCAGCTGGGCGTTGGGAACTACAAGAAGCGT 
CTGGGAATCAACCTGTTGGGGGGACCCCTTGGTGGGAAGCAAGTGATCAGGGTCTCCTGC 
GGTGATGAGTTTACCATTGCTGCCACTGATGAGAAAGTATTGAATTCTAAGACCATCCGT 
TCCAATAGCAGTGGCTTATCCATTGGAACTGTGTTTCAGAGCTCTAGCCCGGGAGGAGGC 
GGCGGGGGCGGCGGTGGTGAAGAAGAGGACAGTCAGCAGGAATCTGAAACTCCTGACCCA 
AGTGGAGGCTTCCGAGGAACAATGGAAGCAGACCGAGGAATGGAAGGTTTAATCAGTCCC 
ACAGAGGCCATGGGGAACAGTAATGGGGCCAGCAGCTCCTGTCCTGGCTGGCTTCGAAAG 
GAGCTGGAAAATGCAGAATTTATCCCCATGCCTGACAGCCCATCTCCTCTCAGTGCAGCG 
TTTTCAGAATCTGAGAAAGATACCCTGCCCTATGAAGAGCTGCAAGGACTCAAAGTGGCC 
TCTGAAGCTCCTTTGGAACACAAACCCCAAGTAGAAGCCTCGGTAACTGAGCTTTTTGCC 
TTTGAATCACAACTAGTCACCTCGGCTGAATCCTGCAGTAACCTGTGCTGGGAAGGGAAC 
ACCACTGACTCCTCCTGCGTGTGCGTGCAGCTCTCTGCAGGTGGAGGTTGA 

SEQ ID NO: 114_R86668_H , MKK6_H 

ATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTCGGCCATCATTGAGCTGGTG 
GAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCGAGCAGCATAATGTCTGCTTCCAC 
TACACTTTTGCCCTCAACCGGAGGAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTG 
CTGCTGCCGCTGGTACAGCTTGAGGGCTCTGTGGCGCCCGATCTGTACTGCATGTGTGGC 
CGTATCTACAAGGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCACCGGGAGCAG 
GCCTATCACTGGTATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACTCAGGCATCAAT 
GCAGCTGTGCTCCTCATTGCTGCCGGGCAGCACTTTGAGGATTCCAAAGAGCTCCGGCTA 
ATAGGCATGAAGCTGGGCTGCCTGCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTAT 
TACTGGGATGTGGGTTTCTACCTGGGAGCCCAGATCCTCGCCAATGACCCCACCCAGGTG 
GTGCTGGCTGCAGAGCAGCTGTATAAGCTCAATGCCCCCATATGGTACCTGGTGTCCGTG 
ATGGAGACCTTCCTGCTCTACCAGCACTTCAGGCCCACGCCAGAGCCCCCTGGAGGGCCA 
CCACGCCGTGCCCACTTCTGGCTCCACTTCTTGCTACAGTCCTGCCAACCATTCAAGACA 
GCCTGTGCCCAGGGCGACCAGTGCTTGGTGCTGGTCCTGGAGATGAACAAGGTGCTGCTG 
CCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTG 
GAGCCTGAGACCCAGGACATTCCCTCCAGCTGGACCTTCCCAGTCGCCTCCATATGCGGA 
GTCAGCGCCTCAAAGCGCGACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCTCAG 
GACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCTGATCCAG 
GCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAG 
ATGTTGGAGTTTGATTATGAGTACACGGAGACGGGCGAGCGGCTGGTGCTGGGCAAGGGC 
ACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCACACGAGGGTGCGCATCGCCATCAAG 
GAGATCCCGGAGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCTTCAC 
AGACGCCTGCGCCACAAGAACATAGTGCGCTATCTGGGCTCAGCTAGCCAGGGCGGCTAC 
CTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTGCGGTCGGTG 
TGGGGACCCCTGAAGGACAACGAGAGCACCATCAGTTTCTACACCCGCCAGATCCTGCAG 
GGACTTGGCTACTTGCACGACAACCACATCGTGCACAGGGACATAAAAGGGGACAATGTG 
CTGATCAACACCTTCAGTGGGCTGCTCAAGATTTCTGACTTCGGCACCTCCAAGCGGCTG 
GCAGGCATCACACCTTGCACTGAGACCTTCACAGGAACTCTGCAGTATATGGCCCCAGAA 
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ATCATTGACCAGGGCCCACGCGGGTATGGGAAAGCAGCTGACATCTGGTCACTGGGCTGC 
ACTGTCATTGAGATGGCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCACAGGCT 
GCCATGTTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCTCTGTCGGCC 
GAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCC 
CAGACACTGCTGGGGGACCCCTTCCTGCAGCCTGGGAAAAGGAGCCGCAGCCCCAGCTCC 
CCACGACATGCTCCACGGCCCTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCAGCCAAC 
TCAACCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTCAGCACCCACCCAGC 
CCCCCGAAGCGCTGCCTCAGTTATGGGGGCACCAGCCAGCTCCGGGTGCCCGAGGAGCCT 
GCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTTCGGGGCTGAGCCTGCTGCACCAGGAG 
AGCAAGCGTCGGGCCATGCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAG 
AATCTGCACCAGGAGCAGAAGCAAGAGCAGGGGGCCCGTCTGGGCAGAAACCATGTGGAA 
GAGCTGCTGCGCTGCCTCGGGGCACACATCCACACTCCCAACCGCCGGCAGCTCGCCCAG 
GAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAGGGCCTTGGGCCTGCGCTTCTGCAC 
AGACCGCTGTTTGCCTTCCCGGATGCGGTGAAGCAGATCCTCCGCAAGCGCCAGATCCGT 
CCACACTGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGT 
GTGCTAGGACCGGAGGTGGAGAAGGAGGCGGTCTCACCGAGGTCAGAGGAGCTGAGTAAT 
GAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGCAG 
GGCCCCGCTCCTCTGATGGTGCAGCTGAGCCTCTTGAGGGCAGAGACTGATCGGCTGCGC 
GAAATCCTGGCGGGGAAGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCGG 
CTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCTTTCAACG 
GACCAGGGCCTGGTGCAGTGGCTACAGGAACTGAATGTGGATTCAGGCACCATCCAAATG 
CTGTTGAACCATAGCTTCACCCTCCACACTCTGCTCACCTATGCCACTCGAGATGACCTC 
ATCTACACCCGCATCAGGGGAGGGATGGTATGCCGCATCTGGAGGGCCATCTTGGCACAG 
CGAGCAGGATCCACACCAGTCACCTCTGGACCCTGA 

SEQ ID NO: 115_PAK6_H 

ATGTTTGGGAAGAAAAAGAAAAAGATTGAAATATCTGGCCCGTCCAACTTTGAACACAGG 
GTTCATACTGGGTTTGATCCACAAGAGCAGAAGTTTACCGGCCTTCCCCAGCAGTGGCAC 
AGCCTGTTAGCAGATACGGCCAACAGGCCAAAGCCTATGGTGGACCCTTCATGCATCACA 
CCCATCCAGCTGGCTCCTATGAAGACAATCGTTAGAGGAAACJ^AACCCTGCAAGGAAACC 
TCCATCAACGGCCTGCTAGAGGATTTTGACAACATCTCGGTGACTCGCTCCAACTCCCTA 
AGGAAAGAAAGCCCACCCACCCCAGATCAGGGAGCCTCCAGCCACGGTCCAGGCCACGCG 
GAAGAAAATGGCTTCATCACCTTCTCCCAGTATTCCAGCGAATCCGATACTACTGCTGAC 
TACACGACCGAAAAGTACAGGGAGAAGAGTCTCTATGGAGATGATCTGGATCCGTATTAT 
AGAGGCAGCCACGCAGCCAAGCAAAATGGGCACGTAATGAAAATGAAGCACGGGGAGGCC 
TACTATTCTGAGGTGAAGCCTTTGAAATCCGATTTTGCCAGATTTTCTGCCGATTATCAC 
TCACATTTGGACTCACTGAGCAAACCAAGTGAATACAGTGACCTCAAGTGGGAGTATCAG 
AGAGCCTCGAGTAGCTCCCCTCTGGATTATTCATTCCAATTCACACCTTCTAGAACTGCA 
GGGACCAGCGGGTGCTCCAAGGAGAGCCTGGCGTACAGTGAAAGTGAATGGGGACCCAGC 
CTGGATGACTATGACAGGAGGCCAAAGTCTTCGTACCTGAATCAGACAAGCCCTCAGCCC 
ACCATGCGGCAGAGGTCCAGGTCAGGCTCGGGACTCCAGGAACCGATGATGCCATTTGGA 
GCAAGTGCATTTAAAACCCATCCCCAAGGACACTCCTACAACTCCTACACCTACCCTCGC 
TTGTCCGAGCCCACAATGTGCATTCCAAAGGTGGATTACGATCGAGCACAGATGGTCCTC 
AGCCCTCCACTGTCAGGGTCTGACACCTACCCCAGGGGCCCTGCCAAACTACCTCAAAGT 
CAAAGCAAATCGGGCTATTCCTCAAGCAGTCACCAGTACCCGTCTGGGTACCACAAAGCC 
ACCTTGTACCATCACCCCTCCCTGCAGAGCAGTTCGCAGTACATCTCCACGGCTTCCTAC 
CTGAGCTCCCTCAGCCTCTCATCCAGCACCTACCCGCCGCCCAGCTGGGGCTCCTCCTCC 
GACCAGCAGCCCTCCAGGGTGTCCCATGAACAGTTTCGGGCGGCCCTGCAGCTGGTGGTC 
AGCCCAGGAGACCCCAGGGAATACTTGGCCAACTTTATCAAAATCGGGGAAGGCTCAACC 
GGCATCGTATGCATCGCCACCGAGAAACACACAGGGAAACAAGTTGCAGTGAAGAAAATG 
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GAACATTCAACATGTATTGTTCATTAAGCTAGCTTCCTAGTTCCGATTAGACTAAGGAGA 
CTAAGCCTAGAGAGTCAATGTTAGAACAGTGAAAAGAATTCTGTGTGTGTGTGTGTGTGT 
GTGTGTGTGTGCACAATAAATAGGAAATGTAGAAACCAAGCAAGAAGGCTTAGTAGCTCA 
GTCTTTAACAAGGGCTAGAAAAGAATGTAATCTGATATGGAAGGATAGCAGCTTCTAATT 
TTCAATCATCTGTTGATATACTGTGAAACTTATTTTATTAAATTAATATTTATTAAATGG 

SEQ ID NO: 117_AA098024_M 

CTGCAGGAGAAGCACCTGTTTCATGGGGATGTGGCTGCCAGGAACATCCTGATCCAAAGT 
GACCTGACTCCCAAACTTTGTCATCTGGGCCTGGCTTATGAAGTTCATGCCCATGGGGCC 
ATCTCCTCTGCTCGATCCAGCACCATCCCTCTCAAGTGGCTTGCTCCAGAAAGGCTTCTC 
CTGAGACCTGCAAGCATCAGGGGAGATATTTGGTCCTTTGGGATCCTGCTTTATGAGATG 
GTGACTCTAGGAGCACCACCATACCCTGAAGTCCCTCCCACCAGCATCCTACAATATCTT 
CAGAGAAAGAAAATCATGAAGAGACCCAGCAGCTGCTCACATGCCATGTACAACATCATG 
AAGTGCTGTTGGCGCTGGAGTGAGGACAGCCGCCCCTTACTTGTTCAGCTGCTCCAGCGC 
CTAGAAGCTGCTTCTAGATCTGCCGATGACAAGGCTGTGTTGCAAGTGCCAGAGTTGGTG 
GTGCCTGAACTGTATGCAGATGTGGCTGGCATCAGGGCAGAAAGCATTTCCTATAGCTTC 
AGTGTCCTTTGAAGATGGTCCTAGACAAATGACTATATATGGGTGGAATTAGTTCCTTCA 
AGAACAGAGAGAAGGAACTTTCTGTGGCCCACCAAGGGAGAAAAAAGGACATGGATCTTG 
CATCTTTCCCTAAACATTTTCCTAGACATCTGAAATGCTGCTGGATGAAGCTCTACCTCT 
ACATACCATGTACTCTTGAGCTAAGAATCACCATCAATTGTAGTTTGCTTTCCAGTCCCA 
AGGGCTGAAGTATAAGTGGTGGACCGTGTCATTCTAAAGGAGGTTTTTAAAATCTGCAAT 
GATTGTAAGGGAATTAGGCAAAAGGGCTGGTCCCACTCACTCCAGGCTGGTTTACTACTG 
AAACTAGTTTTTCTTTTCTTTTTTTTTAAGTTAAACTATTACAGAGTAAAAATAAACCAG 
ATGGGCATGAATGAACACCTTCTAATTTTTAACCATGAATTGAATATTGGAATTCATGAG 
AAAGAAAATTCTAGGTTCTTTTTGCTAAGAGGTGTTAAGGTGAGTCAATATATCCTTCAA 
GGAAAGGCTTTGTCTCATCTATGTTGACGGGACGTAAAAGTCCTCGTCCCGTTATGAAGA 
GCTGAAGAAGATCTATAAGAAACAATACTGAGCCTTTCCTTGACTATAGATAGAAGAGCA 
TCCTTTCATTGAACTCTGAGGCAGGTGGACCATGCATGATACTAAGTTTAATTAGAAGCA 
GGAGGAGTCATTTCATGATTAGGAACATTGTTCATCCCATTTGTTTGCCAGTTCCTGTAA 
GACTAAGGAGAATCAGCCTATAGAGCCAAAGCTAGAACCAGGGATAAAAAGTGTGTGTGT 
GTATAACAAATAGGAAGCATGAAAGTCGAGCAAGAAGACTTAGTAACCCAGGTGGTCATT 
AAGAGGTACAGAGAAGAAGTAATCTTATAGGAATGGATGGTAGCTTCTAATTTTTAACCA 
TTCATTGAAATAACTGTGAAGCAACTCATTAAACTAGTATTTATTGACCAAAAGTAGACT 
TTTCAGGTGTATAGCTGCCAAAATCTCTATAATAAAGAGGCTAAAAGAAAATAAATGGGA 
GTTATTTTACTAGGAAAATTAGAGAACCTATAGTTTCCAAAAAGAGATTCTTTATGTGCA 
AAATGAGATAACTCTCTACCTCACAGGGTTGGTGTGAGGAACAATGAGAATATGTATTTG 
TGTATTATGTAGAATATAATATATTCTCAATAAATACTAGTTTTTCCCCTTTC 

SEQ ID NO: 118_SGK2ALPHA_H 

GAAGAGGGCAGAGCCGTGCATGGGGCTGCTCCCCAGGACCTGAGCAGGAACCTGGAGTTT 
TCAGAGCTGCCTGATCATTGCTACAGAATGAACTCTAGCCCAGCTGGGACCCCAAGTCCA 
CAGCCCTCCAGGGCCAATGGGAACATCAACCTGGGGCCTTCAGCCAACCCAAATGCCCAG 
CCCACGGACTTCGACTTCCTCAAAGTCATCGGCAAAGGGAACTACGGGAAGGTCCTACTG 
GCCAAGCGCAAGTCTGATGGGGCGTTCTATGCAGTGAAGGTACTACAGAAAAAGTCCATC 
TTAAAGAAGAAAGAGCAGAGCCACATCATGGCAGAGCGCAGTGTGCTTCTGAAGAACGTG 
CGGCACCCCTTCCTCGTGGGCCTGCGCTACTCCTTCCAGACACCTGAGAAGCTCTACTTC 
GTGCTCGACTATGTCAACGGGGGAGAGCTCTTCTTCCACCTGCAGCGGGAGCGCCGGTTC 
CTGGAGCCCCGGGCCAGGTTCTACGCTGCTGAGGTGGCCAGCGCCATTGGCTACCTGCAC 
TCCCTCAACATCATTTACAGGGATCTGAAACCAGAGAACATTCTCTTGGACTGCCAGGGA 
CACGTGGTGCTGACGGATTTTGGCCTCTGCAAGGAAGGTGTAGAGCCTGAAGACACCACA 
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FIGURE 2LLLL 

TCCACATTCTGTGGTACCCCTGAGTACTTGGCACCTGAAGTGCTTCGGAAAGAGCCTTAT 
GATCGAGCAGTGGACTGGTGGTGCTTGGGGGCAGTCCTCTACGAGATGCTCCATGGCCTG 
CCGCCCTTCTACAGCCAAGATGTATCCCAGATGTATGAGAACATTCTGCACCAGCCGCTA 
CAGATCCCCGGAGGCCGGACAGTGGCCGCCTGTGACCTCCTGCAAAGCCTTCTCCACAAG 
GACCAGAGGCAGCGGCTGGGCTCCAAAGCAGACTTTCTTGAGATTAAGAACCATGTATTC 
TTCAGCCCCATAAACTGGGATGACCTGTACCACAAGAGGCTAACTCCACCCTTCAACCCA 
AATGTGACAGGACCTGCTGACTTG7^AGCATTTTGACCCAGAGTTCACCCAGG7^AGCTGTG 
TCCAAGTCCATTGGCTGTACCCCTGACACTGTGGCCAGCAGCTCTGGGGCCTCAAGTGCA 
TTCCTGGGATTTTCTTATGCGCCAGAGGATGATGACATCTTGGATTGCTAGAAGAGAAGG 
ACCTGTGAAACTACTGAGGCCAGCTGGTATTAGTAAGGAATTACCTTCAGCTGCTAGGAA 
GAGCGACTCAAACTAACAATGGCTTCAACGAGAAGCAGGTTTATTTTTTCCAGCACATAA 
AAGAAAAATAATGTTTCGGAGTCCAGGACTGGCAGGACAGGTCATCAGATACTCAGAGGC 
TGTATCTCTGCCCTGCCAACCTTGACAAATGGCTTCCAATGTTAGGTTTGCTACAAGATG 
GTTACTGGAGCTCTAGCTGCCTATTTTGTGTTTAGGGAAGGGAAAATGGAGGT^AAGGGGA 
GAAGAGCAAAGGGCGCTTTTAAAGAGCTTTCCCAAAAGCTCCCCCCAATGACTTTTGCTT 
CCATCTCACTAACCACCCACCCCTACCTGGAATGGAGGCTGGGAAATGTGGCTTATTTGC 
TGGGTACGTGACTATCCCTAATAACAAAGGGGTTTTGACCCTAAGACATTAGGGGAGAAT 
GTTGGGTAGGCAGCCAGCCCTCTTTTACCATAGGGCCTCCTGGTGTTTGGATTTTGATCT 
CAATGTGTAAAATGACAGAGATGTAACAAGCTCATAGGGTATCAATATCTCTTATTGTTC 
TATGTTGAAAAA 

SEQ ID NO: 12 0_CCRK_H 

ATGGACCAGTACTGCATCCTGGGCCGCATCGGGGAGGGCGCCCACGGCATCGTCTTCAAG 
GCCAAGCACGTGGAGACTGGCGAGATAATTGCCCTCAAGAAGGTGGCCCTAAGGCGGTTG 
GAAGACGGCTTCCCTAACCAGGCCCTGCGGGAGATTAAGGCTCTGCAGGAGATGGAGGAC 
AATCAGTATGTGGTACAACTGAAGGCTGTGTTCCCACACGGTGGAGGCTTTGTGCTGGCC 
TTTGAGTTCATGCTGTCGGATCTGGCCGAGGTGGTGCGCCATGCCCAGAGGCCACTAGCC 
CAGGCACAGGTCAAGAGCTACCTGCAGATGCTGCTCAAGGGTGTCGCCTTCTGCCATGCC 
AACAACATTGTACATCGGGACCTGAAACCTGCCAACCTGCTCATCAGCGCCTCAGGCCAG 
CTCAAGATAGCGGACTTTGGCCTGGCTCGAGTCTTTTCCCCAGACGGCAGCCGCCTCTAC 
ACACACCAGGTGGCCACCAGGTCTGTGGGCTGCATCATGGGGGAGCTGTTGAATGGGTCC 
CCCCTTTTCCCGGGCAAGAACGATATTGAACAGCTTTGCTATGTGCTTCGCATCTTGGGC 
ACCCCT^AACCCTCAAGTCTGGCCGGAGCTCACTGAGCTGCCGGACTACAACAAGATCTCC 
TTTAAGGAGCAGGTGCCCATGCCCCTGGAGGAGGTGCTGCCTGACGTCTCTCCCCAGGCA 
TTGGATCTGCTGGGTCAATTCCTTCTCTACCCTCCTCACCAGCGCATCGCAGCTTCCAAG 
GCTCTCCTCCATCAGTACTTCTTCACAGCTCCCCTGCCTGCCCATCCATCTGAGCTGCCG 
ATTCCTCAGCGTCTAGGGGGACCTGCCCCCAAGGCCCATCCAGGGCCCCCCCACATCCAT 
GACTTCCACGTGGACCGGCCTCTTGAGGGAGTCGCTGTTGAACCCAGAGCTGATTCGGCC 
CTTCATCCTGGAGGGGTGAGAAGTTGGCCCTGGTCCCGTCTGCCTGCTCCTCAGGACCAC 
TCAGTCCACCTGTTCCTCTGCCACCTGCCTGGCTTCACCCTCCAAGGCCTCCCCATGGCC 
ACAGTGGGCCCACACCACACCTTGCCCCTTAGCCCTTGCGAGGGTTGGTCTCGAGGCAGA 
GGTCATGTTCCCAGCCAAGAGTATGAGAACATCCAGTCGAGCAGAGGAGATTCATGGCCT 
GTGCTCGGTGAGCCTTACCTTCTGTGTGCTACTGACGTACCCATCAGGACAGTGAGCTCT 
GCTGCCAGTCAAGGCCTGCATATGCAGAATGACGATGCCTGCCTTGGTGCTGCTTCCCCC 
GAGTGCTGCCTCCTGGTCAAGGAGAAGTGCAGAGAGTAA 

SEQ ID NO: 121_TESK2_H 

GAATTCGCGGCCGCTCGACGCTCAGCAGAGCTACCAGCTGCCCTGTTGGCTTCGCTGGTC 
GGATCGTCCTCCTGGCCCCGCCAAACAGGCGAGCGGCCCCGACTGTGGGGCATGGCAGTA 
GTCTCCTCGTTCTCCGCCGCCGCTAGCCTAGCTGAGTCGCCGGCTTCTGCGCTAGGGGCT 
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CCCACCGCCTCCGCAGGCTAAGGAGCCGCTGCCACCAACGAGCTGTGAGGGTTACTATGC 

TCCCTCTTTGCCGCCGTCTCCTCCTCTTGCCCGCGCAGGCACCCCTCTGGCTGCTCAGTC 

CTGCCTCAGTGTCAAACCAGAAGAGAAGTAAAATTCAACAAAAATTTATGTGTGGAGTTC 

CTTCTTAAAAGAAGAAAAAAGTGATTATTTAGACTATGGATCGGAGCAAACGGAATTCAA 

TTGCAGGATTTCCTCCACGTGTGGAGCGTCTTGAAGAGTTTGAAGGAGGTGGTGGAGGAG 

AAGGAAATGTGAGCCAGGTGGGAAGAGTTTGGCCATCTTCGTATCGAGCTCTTATAAGTG 

CCTTTTCCAGACTGACGCGTTTGGATGATTTCACCTGTGAAAAAATAGGGTCTGGCTTCT 

TTTCTGAAGTGTTCAAGGTACGACACCGAGCTTCTGGTCAGGTGATGGCTCTTAAGATGA 

ACACATTGAGCAGTAACCGGGCAAACATGCTGAAAGAAGTACAGCTCATGAATAGACTCT 

CCCATCCCAACATCCTTAGGTATATCAACTCCGGGAACCTGGAACAGTTGCTAGACAGTA 

ACCTGCATTTGCCTTGGACTGTGAGGGTAAAACTGGCCTATGACATAGCAGTGGGCCTCA 

GCTACCTTCACTTCAAAGGCATTTTTCATCGGGACCTCACATCTAAGAACTGCCTGATAA 

AGAGGGATGAGAATGGTTACTCTGCAGTGGTAGCTGACTTTGGCCTGGCTGAGAAGATCC 

CCGATGTCAGCATGGGGAGTGAGAAGCTGGCCGTGGTGGGTTCCCCATTCTGGATGGCAC 

CTGAGGTTCTCCGAGATGAGCCCTATAATGAAAAGGCAGATGTGTTCTCTTATGGTATCA 

TCCTCTGCGAGATCATCGCCCGCATCCAGGCCGATCCGGACTATCTTCCCCGCACAGAGA 

ATTTCGGGCTGGACTATGATGCTTTCCAGCACATGGTGGGAGACTGTCCCCCAGATTTTC 

TGCAACTTACTTTCAACTGCTGTAACATGGATCCCAAACTGCGCCCATCTTTTGTGGAGA 

TTGGGAAGACCCTGGAGGAAATTCTGAGCCGCCTACAGGAAGAAGAGCAGGAGAGGGATA 

GGAAGCTGCAGCCCACAGCCAGGGGACTCTTGGAGAAAGCACCTGGGGTGAAGCGACTAA 

GCTCACTGGATGACAAGATCCCCCACAAGTCACCATGCCCAAGACGTACCATCTGGCTGT 

CTCGAAGCCAGTCAGATATCTTTTCCCGTAAGCCCCCACGTACAGTGAGTGTCTTGGACC 

CATACTACCGGCCACGAGATGGTGCTGCCCGCACCCCCAAAGTCAACCCTTTTAGTGCTC 

GCCAGGACCTCATGGGGGGCAAGATCAAGTTTTTTGACCTGCCCAGCAAGTCTGTCATCT 

CTCTGGTATTTGACCTGGATGCACCAGGGCCCGGAACTATGCCCCTGGCTGACTGGCAGG 

AGCCCCTGGCCCCACCTATTCGCCGGTGGCGTTCCTTGCCTGGTTCGCCTGAGTTCTTGC 

ATCAAGAGGCTTGTCCATTTGTGGGCCGGGAAGAATCGCTATCTGATGGGCCCCCACCAC 

GCCTAAGTAGTCTCAAGTACAGAGTTAAAGAGATCCCACCATTCCGGGCATCTGCCCTAC 

CAGCTGCTCAAGCCCATGAGGCTATGGACTGCTCCATTCTCCAGGAAGAAAATGGTTTTG 

GGTCCAGGCCCCAGGGGACCAGTCCATGCCCTGCGGGTGCTTCTGAGGAGATGGAGGTAG 

AAGAAAGGCCAGCAGGCTCAACTCCAGCCACCTTCTCCACCTCAGGCATAGGCCTGCAAA 

CCCAGGGAAAGCAGGATGGGTGAGGGGGTTTAGTCCCTGCCTCACCTTGGGGATGGACCT 

TCAGCTGAAACCATATGGCCCCCTAGGTGCACAGCCTTGATTCTTCCCTGGAGCCTACAG 

AGCAGGCAGGCTAGGCCAAGCCAGGCTCAACTTCTGGGCTCCCAGTGCCCATTGGCTGTG 

TATGACGGGAGGCAGCAGTGAGAGGCCTTCCTAGTTAGGGCCAACAGCTGATACCAAGCC 

TCTGAAATCCAGCAAGGAGGTCTGCCTCCCACCAGACCCTCTCCAGTGTACTTCCCCAGA 

TAGGACCAGAGGATGTCTAGTTCTAGGCTGAGCTGGCAGGCAGCTATTACCCCGGTTCTT 

TCCCCACCCCAGGTCTGTCTCTTGCCTTTTCTTGGGGCATATAAGCTACTGAGTGGAACA 

TGGAGCTGATCAAGAGGCCGTAATGGTCATGGCTGTTTCCAGACCTGAATATTGGGTGCT 

TCTTGCCAGTATTCTAAGACATTTGAGTAATTGCTGTTTGCACTTACTGCATGGTCAGAC 

CACGTCACTACATTTCTATGCAAGGGGACAGCAAGGCAGCGTGGTGGTCATGGCTCTTAG 

CTAACCTATTCAAAGACCTTTTCCTGTTGATTAATCTATTTTCATATTTATAAAGGAGTC 

TTAATGTTCTGCCCCATAAGACTTTCAACCTTGTGGTTGGGAGTGGGGCTGGTTTTGTAG 

GCCCTAGGGCCTGCTTCTATGTATTTATCAACATGTGATACATTCAATTGGTTAAATGGT 

TTATACAGGGACTGATTTGCTTCCCTTCCTGCCATGGCTGGAGCTTTGGGAACAGTCTGT 

CCTTACAGAGCTGCAATAAGAAATAACCAAAGATGAAGCTGGTCAAATATTTTCATAACT 

TGCTTCTGTTGATTTTTTTTTTGTAAAACTTTCCCAAGACATTTTCAGACTTAAAAATAA 

AGTCAGTGTTACAGGT 
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23 October 1998 (1998-10-23) 

STRAUSBERG R.: "Homo sapiens cDNA clone 

IMAGE: 1884219." 

XP002165850 

abstract 

DOC. AGAINST INV. 14 (SEQ.IDs. 135, 17) 


6,7 


X 


WO 98 11234 A (HAWKINS PHILLIP R ; INCYTE 
PHARMA INC (US); AU YOUNG JANICE (US); G) 
19 March 1998 (1998-03-19) 
[ SEQ.IDs. 5 and 6 

DOC. AGAINST INV. 15 (SEQ.IDs. 136, 18) 
DOC. AGAINST INV. 16 (SEQ.IDs. 137, 19) 


1-15, 

26-30, 

35-38 


X 


EP 0 861 896 A (DADE BEHRING MARBURG GMBH) 
2 September 1998 (1998-09-02) 

SEQ.IDs. 1 and 2 

DOC. AGAINST INV. 15 (SEQ.IDs. 136, 18) 
DOC. AGAINST INV. 16 (SEQ.IDs. 137, 19) 


1-15, 

26-28, 

35-38 


P.X 


DATABASE EMBL 'Online! 

Accession Number AF205855, 

23 December 1999 (1999-12-23) 

SHIGAEV A. ET AL.: "Mus musculus serum and 

glucocorticoid-dependent protein kinase 

(Sgk) mRNA." 

XP002165851 

abstract 

DOC. AGAINST INV. 16 (SEQ.IDs. 137, 19) 

-/-- 


1,2,4,6, 
7,10-13, 
15 
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WO 00 35946 A (UNIV DUNDEE ;C0HEN PHILIP 
(6B); DEAK MARIA (GB); KOBAYASHI TAKAYAS) 

22 June 2000 (2000-06-22) 
figure 11 

DOC. AGAINST INV. 17 (SEQ.IDs. 138, 20) 
DOC. AGAINST INV. 113 (SEQ.IDs. 234, 116) 

DATABASE EMBL 'Online! 
Accession Number Z98752, 

23 August 1997 (1997-08-23) 

RAMSAY H. : "Human DNA sequence from clone 
RP1-138B7 on chromosome 20ql3.12." 
XP002165852 
abstract 

DOC. AGAINST INV. 17 (SEQ.IDs. 138, 20) 

WO 98 31802 A (GENETICS INST) 
23 July 1998 (1998-07-23) 
SEQ.IDs. 13 and 14 

DOC. AGAINST INV. 21 (SEQ.IDs. 142, 24) 

DATABASE EMBL 'Online! 

Accession Number AI061003, 

23 July 1998 (1998-07-23) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE: 1379597 5' similar to TR:008875 

calcium calmodulin dependent kinase 

CPG16. " 

XP002165853 

abstract 

DOC. AGAINST INV. 22 (SEQ.IDs. 143, 25) 

DATABASE EMBL 'Online! 

Accession Number AA383293, 

18 April 1997 (1997-04-18) 

ADAMS M. D. ET AL.: "Homo sapiens cDNA 

5'-end similar to serine/threonine kinase 

p78." 

XP002165854 
abstract 

DOC. AGAINST INV. 23 (SEQ.IDs. 144, 26) 

DATABASE EMBL 'Online! 

Accession number AA197883, 

29 January 1997 (1997-01-29) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE:654045 5' similar to TR:G406113 

protein kinase I . " 

XP002165855 

abstract 

DOC. AGAINST INV. 24 (SEQ.IDs. 145, 27) 

-/- 



1-14,26, 
27,29,30 



1-14 



6,7,11, 
12 



1,2,4,6, 
7,10-13, 
25 



1,2,4,6, 
7,10-13, 
16 



1,2,4,6, 
7,10-13 
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,x 



WO 99 50395 A (HELIX RESEARCH INST 
;MURAMATSU MASAAKI (JP); TOKUMITSU HIROSHI 
(JP) 7 October 1999 (1999-10-07) 
page 41 -page 51 

DOC. AGAINST INV. 24 (SEO.IDs. 145, 27) 

SANJ0 H. ET AL: "DRAKs, novel 
serine/threonine kinases related to 
death-associated protein kinase that 
trigger apoptosis ." 
JOURNAL OF BIOLOGICAL CHEMISTRY, 
vol. 273, no. 44, 

30 October 1998 (1998-10-30), pages 
29066-29071, XP002165841 
ISSN: 0021-9258 
figure 1 

DOC. AGAINST INV. 26 (SEQ.IDs. 147, 29) 

WO 99 33961 A (AKIRA SHIZUO ;KAWAI TARO 
(JP); ASAH1 CHEMICAL IND (JP)) 

8 July 1999 (1999-07-08) 

SEO.IDs. 5 and 6 

DOC. AGAINST INV. 26 (SEQ.IDs. 147, 29) 
SEQ.IDs. 29 and 30 

DOC. AGAINST INV. 40 (SEQ.IDs. 161, 43) 

NAGASE T. ET AL. : "PREDICTION OF THE 
CODING SEQUENCE OF UNIDENTIFIED HUMAN 
GENES. XIII. THE COMPLETE SEQUENCE OF 100 
NEW CDNA CLONES FROM BRAIN WHICH CODE FOR 
LARGE PROTEINS IN VITRO" 
DNA RESEARCH, 

vol. 6, 26 February 1999 (1999-02-26), 

pages 63-70, XP000952912 

ISSN: 1340-2838 

the whole document 

-& DATABASE EMBL 'Online! 

Accession Number AB023153, 

9 April 1999 (1999-04-09) 

NAGASE T. ET AL. : "Homo sapiens mRNA for 

KIAA0936 protein." 

XP002166242 

abstract 

DOC. AGAINST INV. 54 (SEQ.IDs. 175, 57) 

-& DATABASE EMBL 'Online! 

Accession Number AB023216, 

9 April 1999 (1999-04-09) 

NAGASE T. ET AL.: "Homo sapiens mRNA for 

KIAA0999 protein." 

XP002166243 

abstract 

DOC. AGAINST INV. 28 (SEQ.IDs. 149, 31) 

-/-- 



1-4,6-13 



1-14,16, 
35,37 



1-14,16, 
26,27, 
29,30, 
35,37 



1,2,4-7, 

10-14, 

16,19 
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WO 98 01756 A (UNIV WASHINGTON) 
15 January 1998 (1998-01-15) 



SEQ.IDs. 1 and 2 
DOC . AGAINST 1NV. 



30 (SEQ.IDs. 151, 33) 



DATABASE EMBL 'Online! 

Accession Number W90839, 

9 July 1996 (1996-07-09) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE: 420441 5' similar to gb:M80359 

putative serine/threonine protein kinase." 

XP002165856 

abstract 

DOC. AGAINST INV. 31 (SEQ.IDs. 152, 34) 

NAGASE T. ET AL.: "PREDICTION OF THE 
CODING SEQUENCES OF UNIDENTIFIED HUMAN 
GENES. IV. THE CODING SEQUENCES OF 40 NEW 
GENES (KIAA0121-KIAA0160) DEDUCED BY 
ANALYSIS OF CDNA CLONES FROM HUMAN CELL 
LINE KG-1" 
DNA RESEARCH, 
vol. 2, no. 4, 

31 August 1995 (1995-08-31), pages 

167-174, XP000676653 

ISSN: 1340-2838 

the whole document 

-& DATABASE EMBL 'Online! 

Accession Number D50925, 

1 August 1996 (1996-08-01) 

NAGASE T. ET AL.: "Human mRNA for KIAA0135 

gene. " 

XP002166244 

abstract 

DOC. AGAINST INV. 32 (SEQ.IDs. 153, 35) 
DOC. AGAINST INV. 33 (SEQ.IDs. 154, 36) 

DATABASE EMBL 'Online! 
Accession Number U79240, 
14 December 1996 (1996-12-14) 
YU W. ET AL.: "Human serine/threonine 
kinase mRNA." 
XP002165857 
abstract 
DOC. AGAINST 



INV. 33 (SEQ.IDs. 154, 36) 
-/- 



1-14,16, 

26,27, 

35,36 



1,2,4,6, 
7,10-13, 
16 



1,2,4-7, 
10-14,16 



1,2,4-7, 
10-14 
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X 


DATABASE EMBL 'Online! 


1,2,4,6, 




Accession Number AI036899, 


7,10-13 




29 June 1998 (1998-06-29) 






MARRA M. ET AL.: "Mus musculus cDNA clone 






InAbL : 1 /4ol)l l t> similar to lK:uyy/oo 






serine/threonine protein kinase." 






XP002165858 






abstract 






DOC. AGAINST INV. 33 (SEQ.IDs. 154, 36) 




X 


— 

DATABASE EMBL 'Online! 


6,7 




Accession Number AI469033, 






17 March 1999 (1999-03-17) 






STRAUSBERG K. : Homo sapiens cdna clone 






IMAGE : 2137322. " 






XP002165859 






abstract 






DOC. AGAINST INV. 35 (SEQ.IDs. 156, 38) 




X 


— 

DATABASE EMBL 'Online! 


6,7 




Accession Number AC007225, 




7 April 1999 (1999-04-07) 






BRUCE D. ET AL. : Homo sapiens chromosome 






16 clone RPCI-11.480G7. " 






XP002166245 






abstract 






DOC. AGAINST INV. 37 (SEQ.IDs. 158, 40) 




P,X 




WO 99 49062 A (FAN WUFANG ;GENE LOGIC INC 


1-14, 


(US;; PRAbHAR YAIINDRA (US); 


Zo-30 , 




30 September 1999 (1999-09-30) 


35-38 




SEQ.IDs. 1 and 2 






DOC. AGAINST INV. 37 (SEQ.IDs. 158, 40) 






DOC. AGAINST INV. 55 (SEQ.IDs. 176, 58) 




X 


— 

DATABASE EMBL 'Online! 


1,2,4,6, 




Accession Number AI596766, 


7,10-13, 




26 April 1999 (1999-04-26) 


16 




MARRA M. ET AL.: "Mus musculus cDNA clone 






1MAGE:949322 5' similar to WP:ZC373.4 






CE02377 myosi n-1 i ght-chai n kinase domain." 






XP002166246 






abstract 






DOC. AGAINST INV. 38 (SEQ.IDs. 159, 41) 






DOC. AGAINST INV. 39 (SEQ.IDs. 160, 42) 






-/-- 
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P,X 


NAGASE T. ET AL. : "PREDICTION OF THE 


1,2,4-7, 




CODING SEQUENCES OF UNIDENTIFIED HUMAN 


10-14, 




GENES. XVI. THE COMPLETE SEOUENCES OF 150 


16,25 




NEW CDNA CLONES FROM BRAIN WHICH CODE FOR 




LARGE PROTEINS IN VITRO" 






DNA RESEARCH, 






vol. 7, 28 February 2000 (2000-02-28), 






pages 65-73, XP000923011 






KIAA1297 






-& DATABASE EMBL 'Online! 






Accession Number AB037718, 






14 March 2000 (2000-03-14) 






0HARA 0. ET AL.: "Homo sapiens mRNA for 






KIAA1297 protein." 






XP002166247 






abstract 






DOC. AGAINST INV. 38 (SEQ.IDs. 159, 41) 






DOC. AGAINST INV. 39 (SEQ.IDs. 160, 42) 






-& DATABASE EMBL 'Online! 






Accession Number AB037759, 






14 March 2000 (2000-03-14) 






OHARA 0. ET AL.: "Homo sapiens mRNA for 






KIAA1338 protein." 






XP002167889 






abstract 






DOC. AGAINST INV. 67 (SEQ.IDs. 188, 70) 






-& DATABASE EMBL 'Online! 






Accession Number AB037790, 






14 March 2000 (2000-03-14) 






OHARA 0. ET AL.: "Homo sapiens mRNA for 






KIAA1369 protein." 






XP002167890 






abstract 






DOC. AGAINST INV. 68 (SEQ.IDs. 189, 71) 






-& DATABASE EMBL 'Online! 






Accession Number AB037781, 






14 March 2000 (2000-03-14) 






NAGASE T. ET AL.: Homo sapiens mRNA for 






KIAA1360 protein/partial cds." 






XP002168226 






abstract 






DOC. AGAINST INV. 82 (SEQ.IDs. 203, 85) 




X 


— 

KAWAI T. ET AL.: "Duet is a novel 


1-14,29, 




serine/threonine kinase with Dbl-Homology 


30,35-38 




(DH) and Pleckstrin-Homology (PH) domains" 






GENE, 






vol . 227, no. 2, 






to r~_L._~....~ u -.i. i r\c\r\ t i r\t\t\ o o i o \ ______ 

18 February 1999 (1999-02-18), pages 






249-255, XP004158739 






ISSN: 0378-1119 






the whole document 






DOC. AGAINST INV. 40 (SEQ.IDs. 161, 43) 






-/~ 
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DATABASE EMBL 'Online! 

Accession Number AA454060, 

11 June 1997 (1997-06-11) 

HILLIER L. ET AL.: "Homo sapiens cDNA 

clone IMAGE: 795492 5' similar to TR:G49075 

calmodul in-binding protein." 

XP002166248 

abstract 

DOC. AGAINST INV. 41 (SEQ.IDs. 162, 44) 

DATABASE EMBL 'Online! 

Accession Number AI385971, 

29 January 1999 (1999-01-29) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE:514336 5' similar to WP:R90.1 

CE06325 protein kinase." 

XP002166249 

abstract 

DOC. AGAINST INV. 43 (SEQ.IDs. 164, 46) 

DATABASE EMBL 'Online! 

Accession Number AA436054, 

1 June 1997 (1997-06-01) 

HILLIER L. ET AL.: "Homo sapiens cDNA 

clone IMAGE:730582 5' similar to gb:X66363 

serine/threonine-protein kinase pctaire-1" 

XP002166250 

abstract 

DOC. AGAINST INV. 44 (SEQ.IDs. 165, 47) 

DATABASE EMBL 'Online! 

Accession Number AA061797, 

24 September 1996 (1996-09-24) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

1MAGE:513953 5' similar to gb:X66358 

serine/threonine-protein kinase. " 

XP002166251 

abstract 

DOC. AGAINST INV. 45 (SEQ.IDs. 166, 48) 
DOC. AGAINST INV. 46 (SEQ.IDs. 167, 49) 

WO 98 35015 A (GERHOLD DAVID L ; MERCK & CO 
INC (US)) 13 August 1998 (1998-08-13) 
SEQ.IDs. 2 and 3 

DOC. AGAINST INV. 50 (SEQ.IDs. 171, 53) 
DOC. AGAINST INV. 114 (SEQ.IDs. 235, 117) 

-/-- 



6,7 



1,2,4,6, 
7,10-13, 
16 



1,2,4-7, 
10-14,16 



1,2,4,6, 
7,10-13, 
17 



1-30, 
35-38 
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INAYLtK U. tl ML.. ItlaidC ten Za L 1 On auQ 


A 4, 




comparison ot toui serine- ana 


O I o , AO , 




argi m ne-r lcn \*>i\) protein Kinases. 


9fi 97 
£0 , CI 




BIOCHEMICAL JOURNAL , 






vol. 326, no. 3, 1997, pages 693-700, 






VDAfVO 1 A AO/I n 












xne wnoie oocument 






r\nr ATATMCT TMU El ( Cm THc 179 CVl > 
UUL. AbAlNbl llMV. bl votu.lUS. b4 ; 




D V 

P,X 


WU yy Joyol A vArxtKDLUri INbKlU t , INLY It 


1 — 1 d 1Q 


PHARMA INC (US); CORLEY NEIL C (US); BAN) 


26-30, 




5 August 1999 (1999-08-05) 


35-38 




btu. lus . 6 ana y 






nnr atatmct tmw ci /cm THr 1 70 ca\ 
UUL. AbAlNbl 1IMV. bl (otU.lUS. l/c, b4; 






stu.lDs. 0 ana 1l 






nnr atatmct tm\/ qc /cm THr 01 1 

DOL . AbAINbl 1NV. yb (btU.IUS. £lo, 4J; 




v 

X 


UAiABAbt triDL uniine: 


1 9 A f\ 




Accession Number AAojyyqu, 


7 i n— i 9 




z March iyyo ( lyyo-u^-u*: ; 


1 Q 




MARRA M. ET AL.: "Mus musculus cDNA clone 






IMAGE: 1259911 5' similar to SW:KMLC_RAT 






P20689 myosin light chain kinase." 






XP002167891 






abstract 






nnr atatmct tm\/ cc /cm mr 1 7a cq\ 
UUL . AbAlNbl 1NV. bb (otU.lUS. 1/0, bo; 




v 

X 


HATAD ACE CMDI ' flnl i 1 

UAIABAbt trlbL Un 11 ne ! 


1 9 A—l 
1 ,t ,4-/ , 




Accession Number AL031055, 


1U Is , lo 




10 July 1998 (1998-07-10) 






RAMSAY H.: "Human DNA sequence from clone 






RP1-Z8HZU on chromosome ^uqij.i. 






XrUUZlo/oyZ 






aostract 






nnr apatmct tm\/ rrr /cm Thf 177 cn\ 
DUG. AbAlNbl 1NV. bo (btU.lUS. 1//, by; 




D V 

P,X 


nATADACC CMDI 'nnl-in&l 

ua I AbAbt triDL un 1 1 ne * 


9 7 
£,0, / , 


Accession Number AL137662, 


11,12 




27 January 2000 (2000-01-27) 






K0EHRER K. ET AL.: "Homo sapiens mRNA" 






XP002167893 






abstract 






DOC . AGAINST INV. 57 (SEO.IDs. 178, 60) 






-/-- 
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X 


NAGASE T. ET AL.: "Prediction of the 


2,6,7, 




coding sequences of unidentified human 


11,12 




genes. VII. The complete sequences of 100 






new cDNA clones from brain which can code 






for large proteins in vitro" 






DNA RESEARCH, 






vol. 4, no. 4, 28 April 1997 (1997-04-28), 






pages 141-150, XP002102085 






ISSN: 1340-2838 






the whole document 






-& DATABASE EMBL 'Online! 






Accession Number AB002342, 






1 July 1997 (1997-07-01) 






NAGASE T. ET AL.: Human mRNA for KIAA0344 






gene. " 






XP002167894 






abstract 






DOC . AGAINST INV. 59 (SEQ.IDs. 180, 62) 




X 


WO 98 36054 A (HOOPER JOHN DAVID ;AMRAD 


1-14, 




OPERATIONS PTY LTD (AU); ANTALIS T0NI MAR) 


26-30, 




20 August 1998 (1998-08-20) 


35-38 




SEQ.IDs. 9 and 10 






DOC. AGAINST INV. 60 (SEQ.IDs. 181, 63) 




X 


ANDERSON K. A. ET AL. : "Components of a 


1-14,23, 




calmodul in-dependent protein kinase 


26,27 




cascade. Molecular cloning, functional 






characterization and cellular localization 






of Ca2+/calmodul in-dependent protein 






kinase kinase beta." 






JOURNAL OF BIOLOGICAL CHEMISTRY, 






vol . 273, no. 48, 






27 November 1998 (1998-11-27), pages 






31880-31889, XP002167887 






ISSN: 0021-9258 






the whole document 






-& DATABASE EMBL 'Online! 






Accession Number AF140507, 






21 May 1999 (1999-05-21) 






ANDERSON K . A. ET AL.: "Homo sapiens 






Ca2+/calmodulin-dependent protein kinase 






beta, complete cds." 






XP002167895 






abstract 






DOC. AGAINST INV. 62 (SEQ.IDs. 183, 65) 




P,X 


W0 99 58558 A (INCYTE PHARMA INC 


2,6,7,9, 


; PATTERSON CHANDRA (US); YUE HENRY (US); 


11,12, 




BANDMAN) 18 November 1999 (1999-11-18) 


26-28 




SEQ.IDs. 2 and 15 






DOC. AGAINST INV. 62 (SEQ.IDs. 183, 65) 






-/~ 
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Accession Number AC004685, 

15 May 1998 (1998-05-15) 

ADAMS M. D. AND LOFTUS B. J. : "Homo 

sapiens chromosome 16 BAC clone 

CIT987SK-A-233A8." 

XP002167896 

abstract 

DOC. AGAINST INV. 63 (SEQ.IDs. 184, 66) 


1,2,4-7 


P.X 


US 6 013 455 A (AZIMZAI YALDA ET AL) 
11 January 2000 (2000-01-11) 

SEQ.IDs. 2 and 11 

DOC. AGAINST INV. 63 (SEQ.IDs. 184, 66) 


1-14, 

26-30, 

35-38 


X 


EP 0 870 825 A (SMITHKLINE BEECHAM CORP) 
14 October 1998 (1998-10-14) 

figure 2 

DOC. AGAINST INV. 65 (SEQ.IDs. 186, 68) 
DOC. AGAINST INV. 66 (SEQ.IDs. 187, 69) 


1-14, 

25-30, 

35-38 


X 


DATABASE EMBL 'Online! 

Accession Number AA589241, 

18 September 1997 (1997-09-18) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE:992145 5' similar to WP:F49E11.1 

CE05897 serine/threonine protein kinase." 

XP002167897 

abstract 

DOC. AGAINST INV. 66 (SEQ.IDs. 187, 69) 


1,2,4,6, 
7,10-13, 
25 


P,X 


BERLANGA J. J. ET AL.: "Characterization 
of a mammalian homolog of the GCN2 
eukaryotic initiation factor 2alpha 
Kinase 11 

EUROPEAN JOURNAL OF BIOCHEMISTRY, 

vol. 265, no. 2, October 1999 (1999-10), 

pages 754-762, XP002167888 

ISSN: 0014-2956 

the whole document 

DOC. AGAINST INV. 67 (SEQ.IDs. 188, 70) 


1-14, 
25-27,35 


X 


WO 94 05794 A (MASSACHUSETTS INST 
TECHNOLOGY) 17 March 1994 (1994-03-17) 

page 13, paragraph 2 
examples 1,2 

DOC. AGAINST INV. 68 (SEQ.IDs. 189, 71) 

-/~ 


1-14, 

26-28, 

35-38 
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represented by SEQ.ID.122 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ.ID.122 or a 
fragment thereof; an antibody or antibody fragment having 
specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ.ID.122. 



2. Claims: Inventions 2-78: Claims 1-20, 

23-38 (all partially and as far as applicable) 



A nucleic acid molecule encoding a kinase with a polypeptide 
sequence selected from SEQ. IDs. 123-199, wherein invention 2 
is limited to SEQ. ID. 123, invention 3 is limited to SEQ. ID. 

124, , and invention 78 is limited to SEQ. ID. 199, or a 

domain thereof; a vector and a recombinant cell comprising 
said nucleic acid molecule; a nucleic acid probe for the 
detection of said nucleic acid molecule; a polypeptide as 
represented by a polypeptide sequence selected from 
SEQ. IDs. 123-199 or a fragment thereof; an antibody or 
antibody fragment having specific binding affinity to said 
polypeptide; a hybridoma which produces said antibody; 
methods for identifying a substance that modulates the 
kinase activity of said polypeptide; methods for treating a 
disease or disorder by administering such a substance; and 
methods for the detection of a kinase polypeptide 
represented by a polypeptide sequence selected from 
SEQ. IDs. 123-199. 



3. Claims: Invention 79: Claim 21 (completely) and Claims 
1-14,26-38 (all partially) 



A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ. ID. 200 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ. ID. 200 or a 
fragment thereof; an antibody or antibody fragment having 
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specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ. ID. 200. 



4. Claims: Invention 80: Claim 22 (completely) and Claims 
1-14,26-38 (all partially) 



A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ. ID. 201 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ.ID.201 or a 
fragment thereof; an antibody or antibody fragment having 
specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ.ID.201. 



5. Claims: Inventions 81-121: Claims 1-20, 

23-38 (all partially and as far as applicable) 



A nucleic acid molecule encoding a kinase with a polypeptide 
sequence selected from SEQ. IDs .202-242, wherein invention 81 
is limited to SEQ. ID. 202, invention 82 is limited to 

SEQ. ID. 203, , and invention 121 is limited to 

SEQ. ID. 242, or a domain thereof; a vector and a recombinant 
cell comprising said nucleic acid molecule; a nucleic acid 
probe for the detection of said nucleic acid molecule; a 
polypeptide as represented by a polypeptide sequence 
selected from SEQ. IDs .202-242 or a fragment thereof; an 
antibody or antibody fragment having specific binding 
affinity to said polypeptide; a hybridoma which produces 
said antibody; methods for identifying a substance that 
modulates the kinase activity of said polypeptide; methods 
for treating a disease or disorder by administering such a 
substance; and methods for the detection of a kinase 
polypeptide represented by a polypeptide sequence selected 
from SEQ. IDs. 202-242. 



6. Claims: Inventions 122-136: Claims 15-20, 

23-25 (all partially) and claims 1-14, 
26-38 (if applicable) 
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A nucleic acid molecule encoding a kinase polypeptide as 
represented by a 'gene name' selected from ^980090' , 
'AA045601\ 'AA297313', 'N23936', , 5R72-18-1\ '5R79-54-1', 
'5R65-16-1', 'AA065538', 'H17727', 'W08549', 'AA430250', 
'AA139478', 'R87679', 'W65887', 'AA948538', '5R69-23-3', and 
'5R69-26-2', wherein invention 122 is limited to 'AA980090', 

invention 123 is limited to 'AA04560r, and invention 

136 is limited to '5R69-26-2', or a domain thereof; a vector 
and a recombinant cell comprising said nucleic acid 
molecule; a nucleic acid probe for the detection of said 
nucleic acid molecule; a polypeptide encoded by said nucleic 
acid molecule or a fragment thereof; an antibody or antibody 
fragment having specific binding affinity to said 
polypeptide; a hybridoma which produces said antibody; 
methods for identifying a substance that modulates the 
kinase activity of said polypeptide; methods for treating a 
disease or disorder by administering such a substance; and 
methods for the detection of a said polypeptide. 
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Continuation of Box 1.2 
Claims Nos. : 31-34 



The search was based on the sequence listing furnished in computer 
readable form, the numbering of which differs from the numbering in the 
figures. 

Claims 31-34 refer to a 'substance that modulates the activity of a 
kinase' without giving a true technical characterization. Moreover, no 
such specific compounds are defined in the application. In consequence, 
the scope of said claims is ambiguous and vague, and their subject-matter 
is not sufficiently disclosed and supported (Art. 5 and 6 PCT). 
No search can be carried out for such purely speculative claims whose 
wording is, in fact, a mere recitation of the results to be achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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